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roENTOlCATION/AUTHENTICATION CODING METHOD AND APPARATUS 

Field of the Invention 

The present invention relates to the embedding of robust identification codes in 

electronic, optical and physical media, and the subsequent, objective discernment of such codes for 

5 identification purposes even after intervening distortion or corruption of the media. 

The invention is illustrated with reference to sevml exemplary applications, 

including idratification/authentication coding of electronic imagery, serial data signals (e.g. audio and 

video), emulsion film, and paper cuirency, but is not so limited. 

Background and Summary of the Invention 

10 "/ would never put it in the power of any printer or publisher 

to stq>press or alter a work of mine, by making him master of 
the copy 

Thomas Paine, Rights of Man, 1792. 
15 Ihe printer dares not go beyond his licensed copy"* 

Milton, Aeropagetica, 1644. 

Since time immonorial, unauthorized use and outright piracy of proprietary 
source material has been a soim;e of lost revenue, confusion, and artistic corruption. 

20 These historical problems have been compounded by the advent of digital 

technology. With it, the technology of copying materials and redistributing them in unauthorized 
manners has reached new heights of sophistication, and more importantly, omnipresence. Lacking 
objective means for comparing an alleged copy of material with the origmal, owners and possible 
litigation proceedings are left with a subjective opmion of whether the alleged copy is stolen, or 

25 has been used in an unauthorized manner. Furthermore, ihm is no simple means of tracing a path 
to an original purchaser of the material, something which can be valuable in tracmg whoe a 
possible "leak" of the material first occurred. 

A variety of methods for protecting commercial material have been attempted. 
One is to scramble signals via an encoding method prior to distribution, and descrambie prior to 

30 use. lliis technique, however, requires that both the origmal and later descrambled signals never 
leave closed and controlled networks, lest they be intercepted and recorded. Furthermore, this 
arrangement is of little use in the broad field of mass marketing audio and visual material, where 
even a few dollars extra cost causes a major reduction in market, and where the signal must 
eventually be descrambled to be perceived, and thus can be easily recorded. 

35 Another class of techniques relies on modification of source audio or video 

signals to include a subliminal identification signal, which can be sensed by electronic means. 
Examples of such systems are found in U.S. Patent 4,972,471 and European patent publication EP 
441,702, as well as in Komatsu et al, "Authentication System Using Concealed Image in 
Telematics," Memou^ of the School of Science & Engineering, Waseda University, No. 52, p. 45- 

40 60 (1988) (Komatsu uses the term "digital watermark" for this technique). An elementary 
introduction to these methods is found in the article ^'Digital Signatures," Byte Magazine, 
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November, 1993. p. 309. These tecfaniques have the common characteristic that deteiministic 
signals with well defined patterns and sequences withm the souice material conv<?y the 
identification information. For certain applications this is not a drawback. But in general, this is 
an inefficient form of embedding identification information for a variety of reasons: (a) the whole 
of the source material is not used; (b) determmistic patterns have a higher likelihood of being 
discovered and removed by a would-be phate; and (c) the signals are not generally 'holographic' 
in that identifications may be difficult to make given only sections of the whole. ('Holographic' 
is used herein to refer to the pn^erty that the identification mformation is distributed globally 
throughout the coded signal, and can be folly discerned fitmi an exammation of even a fiaction of 
the coded signal. Codmg of this type is sometimes termed "distributed" herein.) 

Among the cited refmnces are desmptions of several programs which perfram 
steganography - described m one document as "... the ancient art of hiding mfoimation m some 
otherwise inconspicuous information." These programs variously aUow computer users to hide 
their own messages inside digital image files and digital audio files. All do so by togglmg the 
least significant bit (the lowest order bit of a single data sample) of a given audio data stream or 
rastenzed image. Some of these programs embed messages quite directly into the least significant 
bit, while other "pre-encrypt" or scramble a message first and then embed the encrypted data mto 
the least significant hit. 

Our current understandmg of these programs is that they generaUy rely on 
error-fiec transmission of the of digital data in order to conectiy transmit a given message in its 
entirety. Typically the message is passed only once. i.e, it is not repeated. These programs also 
seem to "take over" the least significant bit entirely, where actual data is obliterated and the 
message placed accordingly. This might mean that such codes could be easily erased by merely 
strippmg off the least significant bit of aU data values in a given nnage or audio file. It is these 
and other considerations which suggest that the only similarity between our invention and the 
established art of steganography is in the placement of information mto data files with minimal 
perceptibility. The specifics of embeddmg and the uses of that buried information diverge from 
there. 

Another cited reference is U.S. Patent 5.325,167 to Melen. to the service of 
authenticating a given docmnent. the high precision seamimg of that document reveals patterns and 
"microscopic grain structure" which apparently is a kind of miique fingerprint for the miderlymg 
docmnent media, such as paper itself or post-applied materials such as toner. Melen fiirther 
teaches that scamiing and storing this fmgetprint can later be used in authentication by scamimg a 
purported document and comparing it to the original fmgerprint. Applicant is aware of a shnUar 
idea employed m the very high precision recording of credit card magnetic strips, as reported m 
the February 8, 1994. WaU Sdeet Jomnal. page Bl. wherein very fme magnetic fluxuations tend 
to be miique from one card to the next, so that credit card authentication could be achieved 
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through pre-recording these iluxuations later to be compared to the recordings of the purportedly 
same credit card. 

Both of the foregoing techniques appear to rest on the same identification 
principles on which the mature science of fingerprint analysis rests: the innate uniqueness of some 
5 localized physical property. These methods then rely upon a single judgement and/or 

measurement of "similarity" or "correlation" between a suspect and a pre-recording master. 
Though fingerprint analysis has brought this to a high art, these methods are nevertheless open to 
a claim that pr^arations of the samples, and the "filtering" and "scanner specifications" of 
Melen's patent, unavoidably tend to bias the resulting judgement of similarity, and would create a 

10 need for more esoteric "expert testimony" to explain &e confidence of a found match or 

mis-match. An object of tiie presoit invention is to avoid this reliance on expert testimony and to 
place the confidence in a match into simplt "com flip" vernacular, i.e., what are the odds you can 
call the correct coin flip 16 times in a row. Attempts to identify fi-agments of a fingerprint^ 
document, or otherwise, exacerbate this issue of confidence in a judgment, where it is an object of 

15 the present invention to objectively apply the intuitive "coin flip" confidence to the smallest 

fragment possible. Also, storing unique fingerprints for each and every document or credit card 
magnetic strip, and having these fingerprints readily available for later cross-cheddng, ^ould 
prove to be quite an economic undertaking; It is an object of this invention to allow for the 
"re-use" of noise codes and "snowy images" in the service of easing storage requirements. 

20 U.S. Patoit 4,921,278 to Shiang et al. teadies a kind of spatial encryption 

technique wherem a signature or photograph is spiked out into what the untrained eye would 
refer to as noise, but which is actually a well defined structure referred to as Moire patterns. The 
similarities of the present invention to Shiang' s system appear to be use of noise-like patterns 
which nevertheless cany information, and the use of this principle on credit cards and other 

25 identification cards. 

Others of the cited patents deal with other techniques for identification and/or 
authentication of signals or media. U.S. Patent 4,944,036 to Hyatt does not appear to be 
qsplicabie to the present invention, but does point out diat die tmn "signature" can be equally 
qsplied to signals v^ch carry unique characteristics based on physical structure. 

30 Despite the foregomg and other diverse work in the field of 

identification/authentication, there still remains a need for a reliable and efficient method for 
performing a positive identification between a copy of an original signal and the original. 
Deskably, this method should not only perform identification, it should also be able to conv^ 
source-version information in order to better pinpoint the point of sale. The method shoiild not 

35 compromise the iimate quality of material which is being sold, as does the placement of localized 
logos on images. The method should be robust so that an identification can be made even after 
multiple copies have been made and/or compression and decompression of the signal has taken 
place. Hie identification method should be largely uneraseable or "uncrackable.** The method 
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should be capable of working even on fractional pieces of the original signal, sudi as a 10 seccmd 
"rifir of an audio signal or the "clipped and pasted" sub-section of an origmal image. 

The existence of such a method would have profound consequences on piracy in 
that it could (a) cost effectively monitor for unauthorized uses of material and p^form "quick 
5 checks-; (b) become a deterrent to unauthorized uses when the method is known to be in use and 
the consequences weU publicized; and (c) provide unequivocal proof of identity, similar to 
fingaprint identification, in litigation, with potentially more reliability than that of fingerprinting. 

In accordance with an exemplaiy mbodiment of the invention, the foregoing and 
additional objects are achieved by anbedding an imperceptible identification code throughout a 
10 source signal. In the preferred onbodiment, this embedding is achieved by modulating the source 
signal with a small noise signal in a coded fashicm. More particularly, bits of a binary 
identification code are referenced, one at a time, to control modulation of the source signal with 
the noise signal. 

Ihe copy with the embedded signal (the "encoded" copy) becomes the material 
15 which is sold, while the original is secured m a safe place. The new copy is nearly identical to 
the original except undo- the finest of scrutiny; thus, its commercial value is not compromised. 
After the new copy has been sold and distributed and potentially distorted by multiple copies, the 
present disclosure detaik methods for positively identifying any suspect signal against the original. 

Among its other advantages, the preferred embodiments* use of identification 
20 signals which are global Qiolographic) and which mimic natural noise sources allows the 

maximization of identification signal energy, as opposed to merely having it present 'somewhere 
in the origmal material.' This allows the identification codmg to be much more robust in the face 
of thousands of real world degradation processes and material transformations, such as cuttmg and 
cropping of imagery. 

foregoing and additional features and advantages of the present invention 
will be more readily apparent fit«n the following detailed description thereof, which proceeds with 
reference to the accompanying drawings. 

Brief Description of the Drawings 
Fig. 1 is a simple and classic depiction of a one dimensional digital signal which 
30 is discretized in both axes. 

Fig. 2 is a general overview, with detailed description of steps, of the process of 
embedding an "imperceptible" identification signal onto another signal. 

Fig. 3 is a step-wise description of how a suspected copy of an original is 

identified. 

Fig. 4 is a schematic view of an apparatus for pre-exposing fihn with 
identification information in accordance with another embodiment of the present invention. 

Fig. 5 is a diagram of a "black box" embodiment of the present invention. 
Fig. 6 is a schematic block diagram of the embodhnent of Fig. 5. 
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Fig. 7 shows a variant of the Fig. 6 embodiment adapted to encode successive 
isets of input data with di£ferent code words but with the same noise data. 

Fig. 8 shows a variant of the Fig. 6 embodiment adi^ted to encode each frame 
of a videotaped production with a unique code number. 
5 Fig3. 9A-9C are representations of an industry standard noise second that can be 

used in one embodimeiit of the present invention. 

Fig. 10 shows an integrated circuit used in detecting standard noise codes. 
Fig. 11 shows a process flow for detecting a standard noise code that can be used 
in the Fig. 10 embodiment. 
10 Fig. 12 is an mbodiment employing a plurality of detectors in accordance with 

another embodiment of the presmt mvention. 

Detailed Description 

In the following discussion of an illustrative embodiment, the words "signal" and 
"image" are used interchangeably to refer to both one, two, and even beyond two dimensions of 

15 digital signal. Examples will routinely switch back and forth between a one dimensional 
audio-type digital signal and a two dimensional unage-type digital signal. 

In order to fuUy describe the details of an illustrative embodiment of the 
invention, it is necessary first to describe the basic properties of a digital signal. Fig. 1 shows a 
classic representation of a one dimensional digital signal. Ihe x-axis defines the index numbers of 

20 sequence of digital **samples," and the y-axis is the instantaneous value of the signal at that 

sample, being constrained to exist only at a finite numbo' of levels defined as the "binary dq)th" 
of a digital sample. Hie example depicted in Fig. 1 has the value of 2 to the fourth power, or M 
bits," giving 16 allowed states of the sample value. 

For audio information such as sound waves, it is commonly accepted that the 

25 digitization process discretizes a continuous phenomena both in the tune domain and in the signal 
level domain. As such, the process of digitization itself introduces a fundamental error source, in 
that it cannot record detail smaller than the discretization interval in either domain. The industry 
has referred to this, among other ways, as **aliasing" in the time domain, and "quantization noise" 
in the signal level domain. Hius, there will always be a basic error floOT of a digital signal. Pure 

30 quantization noise, measured in a root mean square sense, is theoretically known to have the value 
of one over the square root of twelve, or about 0.29 DN, y/here DN stands for 'Digital Number' 
or the finest unit increment of the signal level. For example, a perfect 12-bit digitizer will have 
4096 allowed DN with an iimate root mean square noise floor of ~029 DN. 

All known physical measurement processes add additional noise to the 

35 transformation of a continuous signal into the digital form. The quantization noise typically adds 
in quadrature (square root of the mean squares) to the "analog noise" of the measurement process, 
as it is sometimes referred to. 
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With almost all commercial and technical processes, the use of the decibel scale 
is used as a measm-e of signal and noise in a given recording medium. The expression "signal-to- 
noise ratio" is generally used, as it wiU be in this disclosure. As an example, this disclosure refers 
to signal to noise ratios m terms of signal power and noise power, thus 20 dB represents a 10 
5 times increase in signal amplitude. 

In summary, the presently preferred embodiments of the* invention embed an 
N-bit vahie onto an entire signal through tiie addition of a voy low amplitude encodation signal 
which has the look of pure noise. N is usually at least 8 and is capped on the higher raid by 
ultimate signal-to-noise considerations and "bit error" m retrieving and decoding the N-bit value. 
10 As a practical matter, N is chosoi based on application specific considerations, such as the number 
of unique different "signatures" that are desired. To Olustrate, if N=128, then the number of 
unique digital signatures is in excess of 10^^38 (2^^128). Hiis number is believed to be more 
than adequate to both identify the material with sufficient statistical certainly and to index exact 
sale and distribution information. 

The amplitude or power of this added signal is determined by the aesthetic and 
informational considerations of each and every qjplication using the present methodology. For 
instance, non-professional video can stand to have a higher embedded signal level without 
becommg noticeable to the average human eye. while high precision audio may only be able to 
accept a relatively small signal level lest &e human ear perceive an objectionable increase m 
20 "hiss." These statements are generalities and each application has its own set of criteria in 

choosing the signal level of the embedded identification signal. Hie higher the level of embedded 
signal, the more corrupted a copy can be and still be identified. On the other hand, the higher the 
level of embedded signal, the more objectionable the perceived noise might be, potentially 
impacting the value of the distributed material. 

To illustrate the range of different ^plications to which the principles of the 
present invention can be applied, the present specification details two different systems. The fnst 
(termed, for lack of a better name, a "batch encoding" system), ^lies identification coding to an 
existing data signal. Hie second (tHmed, for lack of a better name, a "real time encoding" 
system), applies identification coding to a signal as it is produced Those skilled in the art will 
30 recognize that the principles of the present invoition can be ^lied in a number of other contexts 
in addition to these particularly described. 

TTie discussions of these two systems can be read in either order. Some readers 
may fmd the latter more intuitive than the former; for others the contrary may be true. 
BATCH ENCODING 

35 The following discussion of a fu-st class of embodiments is best prefaced by a 

section defining relevant terms: 

The original signal refers to either the original digital signal or the high quality 
digitized copy of a non-digital original. 
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Uie N-hit identification word refers to a unique identification binary value, 
typically having N range anywhere from 8 to 128, which is the identification code ultimately 
placed onto the original signal via the disclosed transformation process. In the Ulustrated 
embodiment, each N-bit identification word begins with the sequence of values '0101,' which is 
5 used to determine an optimization of the signal-to-noise ratio in the identification procedure of a 
subject signal (see definition below). 

The m'lh bit value of the N-bit identification w«d is either a zero or one 
coiresponding to the value of the m'lh place, reading kft to right, of the N-bit word. Eg, the 
first (m=l) bit vatae of the N=8 identification word 01110100 is the value '0;' the second bit 
10 vatoe of tiiis identification wwd is ' 1', etc. 

The m'fli individual embedded code signal refers to a signal which has 
dunensions and extent precisely equal to the original signal (e.g. both are a 512 by 512 digital 
unage), and whidi is (in the illustrated embodiment) an independent pseudo-random sequence of 
digital values. -Pseudo" pays homage to tiie difficulty in philosophically defming pure 
15 randomness, and also indicates that there are various acceptable ways of generating the "random- 
signal. There win be exactly N individual embedded code signals assodated with any givm 
original signal. 

TTie acjcentable perceived noise level tefiers to an qjpUcatian-qiedfic 
determination of how much "extra noise." i.e. amplitude of the composite embedded code signal 
20 described next, can be added to flie original signal and stiB have an acceptable signal to sell or 
otherwise distribute. This disclosure uses a 1 dB increase in noise as a typical value which might 
be accq)table, but this is quite arbitrary. 

■nie cnmposite embedded code signal refers to the signal which has dimensions 
and extent precisely equal to the original signal, (e.g. ooth are a 512 by 512 digital image), and 
25 which contains the addition and ^propriate attenuation of the N individual embedded code 
signals. The individual embedded signals are generated on an arbitrary scale, whweas the 
amplitude of the composite signal must not exceed the pre-set acceptable perceived noise level, 
hence the need for "attenuation" of the N added individual code signals. 

The distributable signal refers to the nearly similar copy of the original signal, 
30 consistmg of the original signal plus the composite embedded code signal. This is the signal 
which is distributed to the outside community, having only slightly higher but acceptable "noise 

properties" than the original. 

A suspect signal refers to a signal which has flie general qjpearance of the 
original and distributed signal and whose potential identification matdi to die original is being 
35 questioned. Hie suspect signal is then analyzed to see if it matches tiie N-bit identification word. 

The detailed methodology of this first embodiment begins by stating tiiat the 
N-bit identification word is encoded onto the original signal by having each of the m bit values 
multiply tiieir corresponding individual embedded code signals, tfie resultant being accumulated in 
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the composite signal, the fully sunimed composite signal then bang attenuated down to fte 
acceptable perceived noise amplitude, and the resultant composite s^ added to the origina] to 
become the distributable signal. 

TTie original signal, the N-bit identification word, and all N individual embedded 
5 code signals are then stored away in a secured place. A suspect signal is then found. This signal 
may have undetgone multiple copies, compressions and decompressions, resamplings onto different 
spaced digital sigmils. transfers from digital to amdog back to digital media, or any combination of 
theseitems. EE Ac signal stfll q)pears similar to the aiginal, ie. its innate qualiQr is not 
thoroughly destroyed by all of these tnmsfbnnations and noise additions, then depending on the 
10 signal to noise properties of the embedded signal, the identification process should function to 

some objective degree of statistical confidence. TTie extent of conuption of the suspect signal and 
the original acceptable perceived noise level are two key parameters in determining an expected 
confidence level of identification. 

The identification process on the suspected signal begins by resampling and 
15 aligning the suspected signal onto the digital format and extent of the original signal. Thus, if an 
image has been reduced by a fi»tor of two. it needs to be digitally enlarged by that same factor. 
Likewise, if a piece of music has been "cut out," but may still have the same sampling rate as the 
origfaal. it is necessary to register this cut-out piece to the original, typically done by perfonning a 
local digital cross^oirelation of the two signals (a common digital operation), finding at what 
delay value the correlation peaks, then using this finmd delay value to register the cut piece to a 
segment of the original. 

Once the suspect signal has been sample-spadng matdied and n^stered to the 
original, the signal levels of the suspect signal should be matched in an rms sense to the signal 
level of the original. This can be done via a search on the parameters of ofiset, amplification, and 
! gamma being optimized by using the minimum of the mean squared error between the two signals 
as a function of the three parameters. We can call the suspect signal normalized and registered at 
this pomt, or just normalized for convmience. 

The newly matched pair then has the original signal subtracted frwn the 
normalized suspect signal to produce a difference signal. The difference signal is then 
cross-correlated with each of the N individual embedded code signals and the peak 
cross-correlation value recorded. TTie first four bit code ('0101') is used as a calibrator both on 
the mean values of the zero value and the one value, and on fiirther registration of the two sigmils 
if a fma- signal to noise ratio is desired (i.e., the optimal separation of the 0101 signal will 
indicate an optimal registration of the two signals and will also indicate the probable existence of 
the N-bit identification signal being present.) 

The resultmg peak caross-correlation values will form a noisy series of floating 
point numbers which can be transformed into O's and I's by their proximity to the mean values of 
0 and 1 found by the 0101 calibration sequence. If the suspect signal has indeed been derived 
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from the original, the identification number resulting from the above process will match the N-bit 
identification word of the original, bearing in mind either predicted or unknown "bit error" 
statistics. Signal-to-noise considerations will determine if there will be some kind of "bit error" in 
the identification process, leading to a form of X% probability of identification where X might be 
5 desired to be 99.9% or whatever. If the suspect copy is indeed not a copy of the original, an 
essentially random sequence of O's and Vs will be produced, as well as an apparent lack of 
separation of the resultant values. Tliis is to say» if the resultant values are plotted on a histogram, 
the existence of the N-bit identification signal will exhibit strong bi-level diaractedstics, whereas 
the non-existence of the code, or the existence of a difforent code of a different original, will 
10 exhibit a type of random gaussian-like distribution. This histogram separation alone should be 
sufficient for an identification, but it is even stronger proof of identification when an exact binary 
sequence can be objectively rqiroduced. 
Specific Example 

hnagine that we have taken a valuable picture of two heads of state at a cocktail 
15 party, pictures which are sure to earn some reasonable fee in the commercial maricet. We desire 
to sell this picture and ensure that it is not used in an imautfaorized or uncompensated manner. 
Hiis and the following steps are summarized in Fig. 2. 

Assume the picture is transfonned into a positive color print We first scan this 
into a digitized form via a normal high quality black and white scanner with a typical photometric 
20 spectral response curve. (It is possible to get better ultimate signal to noise ratios by scanning in 
each of the three primary colors of the color image, but this nuance is not c^tral to describing the 
basic process.) 

Let us assume that the scanned image now becomes a 4000 by 4000 pixel 
monochrome digital image with a grey scale accuracy defined by 12-bit grey values or 4096 
25 allowed levels. We will call this the "original digital image" realizing that this is the same as our 
"original signal" in the above definitions. 

During the scanning process we have arbitrarily set absolute black to correspond 
to digital value *30*. We estimate that there is a basic 2 Digital Number root mean square noise 
existing on the original digital image, plus a theoretical noise Qcnown in the industry as "shot 
30 noise") of the square root of the brightness value of any given pixel. In formula, we have: 

<RMS Noise^> - sqrt(4 + (V.-30)) (1) 

Here, n and m are simple indexing values on rows and colmnns of the image ranging from 0 to 
35 3999. Sqrt is the square root. V is the DN of a given indexed pixel on the original digital image. 
The < > brackets around the RMS noise merely indicates that this is an expected average value, 
where it is clear that each and every pixel will have a random error individually. Thus, for a pixel 
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value having 1200 as a digital number or "brightness value", we find that its &q)ected rms noise 
value is sqrt(1204) = 34.70, which is quite close to 34.64. the square root of 1200. 

We furthermore realize that the square root of the innate brightness value of a 
pixel is not precisely what the eye perceives as a minimum objectionable noise, thus we come up 
with the formula: 

<RMS Addable Noise^> = X*sqrt(4-KV^.30)^Y) (2) 

Where X and Y have been added as anpirical parameters which we will adjust, and "addable" 
noise refers to our acceptable perceived noise level from the definitions above. We now intend to 
experiment with what exact value of X and Y we can choose, but we wiU do so at the same time 
that we are performing the next steps in the process. 

The next step in our process is to choose N of our N-bit identification word. We 
decide that a 16 bit main identification value with its 65536 possible values will be sufficiently 
large to identify the image as ours, and that we will be directly seUing no more than 128 copies of 
the image which we widi to track, giving 7 bits plus an eighth bit for an odd/even adding of the 
first 7 bits (i.e. an error dieddng bit on tiie first seven). The total bits required now are at 4 bits 
for the 0101 calibration sequence, 16 for the main identification, 8 for the version, and we now 
throw in anotha- 4 as a further error checking value on the first 28 bits, givmg 32 bits as N. The 
final 4 bits can use one of many industry standard error checking mediods to choose its four 
values. 

We now randomly determine the 16 bit main identification number, finding for 
example, 1 101 0001 1001 1 1 10; our first versions of the original sold will have all O's as the 
version identifier, and the error checking bits will fall out where they may. We now have our 
25 unique 32 bit identification word which we will embed on the original digital image. 

To do this, we genmte 32 independent random 4000 by 4000 encoding images 
for each bit of our 32 bit identification word. The manner of generating these random images is 
revealmg. There are numerous ways to generate these. By far the simplest is to turn up the gain 
on tiie same scanner that was used to scan m the origmal photograph, only this time placing a pui« 
black image as the input, then scanning this 32 times. Tlie onfy drawback to this technique is that 
it does require a large amount of memory and that "fixed pattern" noise will be part of each 
independent "noise image." But, the fixed pattern noise can be removed via normal "dark frame" 
subtraction techniques, Assume that we set the absolute black average value at digital number 
*100,' and that rather than finding a 2 DN rms noise as we did in the normal gain setting, we now 
35 fmd an rms noise of 10 DN about each and every pixel's mean value. 

We next apply a mid-spatial-frequency bandpass filter (spatial convolution) to 
each and every independent random image, essentially removing the very high and the very low 
spatial frequencies from them. We remove the very low fi^uencies because sunple real-worid 
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enor sources like geometrical waipiiig. splotches on scanners, mis-registrations, and the like will 
exhibit themselves most at lower fiequencies also, and so we want to concentrate our identification 
signal at higher spatial frequencies in order to avoid these types of corruptions. Likewise, we 
remove the higher fiequencies because multiple generation copies of a given image, as well as 
compression-decompression transformations, tend to wipe out higher frequencies anyway, so there 
is no point in placing too much identification signal into these frequencies if they wiD be the ones 
most prone to being attenuated. Therefoie, our new filtered indepemient noise images wiU be 
dominated by mid-spatial fiequencies. On a practical note, since we are using 12-bit vataes on 
our seamier and we have removed the DC value eflfectivdy and our new ims noise wffl be slightly 
less than 10 digital numbers, it is usefiil to boil this down to a 6-bit value ranging from -32 
through 0 to 31 as the resultant random image. 

Next we add all of the random images together which have a ' 1' in their 
corresponding bit value of the 32-bit identification word, accumulating the result in a 16-bit signed 
integer image. This is the unattenuated and un-scaled version of the composite embedded signal. 
]5 Next we experiment visually with adding flie composite anbedded signal to the 

original digital image, through varying the X and Y parameters of equation 2. In formula, we 
visually iterate to both maximize X and to find Ae q^iropriate Y in Ae following: 



10 



20 



35 



Vi_-V^ + V *X*sqrt(4+V^AY) (3) 



where dist refos to the candidate distributable hnagc, Le. we are visually iterating to find what X 
and Y will give us an acceptable image; orig refers to the pfacel value of the original image; and 
comp refers to the pbcel value of the composite image. He n's and m's still index rows and 
columns of the image and indicate that this operation is done on all 4000 by 4000 pixels. The 
25 symbol Vis the DN of a given pfacel and a given image. 

As an arbitraiy assumption, now, we assume that our visual experimentation has 
found that the value of X= 0.025 and Y=0.6 are acceptable values when comparing the original 
image with the candidate distributable image. This is to say, the distributable fanage with the 
-extra noise" is acceptably close to Ae original m an aesthetic sense. Note that since our 
30 individual random images had a random rms noise value around 10 DN, and that adding 

approximately 16 of these images together will increase the composite noise to around 40 DN, the 
X multiplication value of 0.025 will bring the added rms noise back to around 1 DN, or half the 
ampUtude of our innate noise on the original. This is roughly a 1 dB gain in noise at the dark 
pixel values and correspondingly more at the brighter values modified by the Y value of 0.6. 

So with these two values of X and Y, we now have craistructed our first versions 
of a distributable copy of the original. Other versions wiU merely create a new composite signal 
and possibly change the X slightly if deemed necessary. We now lock up the original digital 
image along with the 32-bit identification word for each version, and the 32 independent random 



wo 95/14289 



PCT/US94/13366 



-12- 



4-bit images, waiting for our first case of a suspected piracy of our original. Storage wise, this is 
about 14 Megabytes for the original image and 32*0.5bytes*16 million = -256 Megabytes for the 
random individual encoded images. TWs is quite acceptable for a single valuable image. Some 
storage economy can be gained by simple lossless compression. 
Finding a Susoected Piiacv of our Imap;e 

We sen our image and several months later find our two heads of state in die 
exact poses we sold them in, seemingly cut and lifted out of our image and placed into another 
stylized badcground scene. Ihis new "suspect" image is bemg printed in 100,000 copies of a 
given magazine issue, let us say. We now go about determining if aportion of our orighial image 
has indeed been used in an unauthwized manner. Fig. 3 summarizes the details. 

The fust step is to take an issue of the magazme, cut out the page with the 
image on it, then carefully but not too carefully cut out the two figures fiom the background 
image using ordinary scissors. If possible, we will cut out only one connected piece rather than 
the two figures separately. We paste this onto a black background and scan this into a digital 
fonn. Next we electronically flag or mask out the black background, which is easy to do by 
visual inspecticm. 

We now procure the original digital image from our secured place along with the 
32-bit identification word and the 32 individual embedded images. We place the original digital 
image onto our computer screen using standard image manipulation software, and we roughly cut 
along the same borders as our masked area of the suspect image, masking this image at the same 
time in roughly the same mamier. ITie word 'roughly' is used since an exact cutting is not 
needed, it merely aids die identification statistics to get it reasonably close. 

Next we rescale the masked suspect image to roughly match the size of our 
masked original digital image, that is. we digitaUy scale up or down the suspect image and 
roughly overlay it on the original image. Once we have perfomied this rough registration, we 
then throw the two images into an automated scaling and registration progi^. The program 
performs a search on the three parameters of x position, y position, and spatial scale, with die 
figure of merit being the mean squared errw between the two unages given any given scale 
variable and X and yofifeet. This is a fiiiriy standard image prwessmg methodology. Typically 
diis would be done using generally smoodi interpolation techniques and done to sub-pixel 
accuracy. The search method can be one of many, where the sunplex mediod is a typical one. 

Once the optimal scaling and x-y position variables are found, next comes 
another search on optimizing the black level, brightness gain, and gamma of die two images. 
Again, die figure of merit to be used is mean squared error, and again die simplex or odier search 
metiiodologies can be used to optimize die ihree variables. After tiiese tiiree variables are 
optimized, we apply dieir corrections to die suspect image and align it to exactly die pixel spacing 
and masking of die original digital image and its mask. We can now call diis die standard mask. 
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TTic next step is to subtract the original digital image from the newly nonnalized 
suspect image only within the standard mask region. TTiis new image is called the difference 

image. 

TTien we step through all 32 individual random embedded images, doing a local 
5 cross-coirelation between the masked difference image and the masked individual embedded 

image. 'Local' refers to the idea that one need only start coirclating over- an offset region of +/- 1 
pixels of offset between the nominal registration points of the two images found during the search 
procedures above. TTie peak correlation should be very close to the nominal registration point of 
0,0 ofife^ and we can add the 3 by 3 conelation values togetha- to give one grand correlation 
10 value for each of the 32 individual bits of our 32-bit identification word. 

After doing this for all 32 bit places and their corresponding random images, we 
have a quasi-floating point sequence of 32 values. The fu^ four values represent our caUbration 
signal of 0101. We now take the mean of the first and third floating point value and call this 
floating point value '0/ and we take the mean of the second and the fourth value and caU this 
15 floating point value '1.' We then step through all remaming 28 bit values and assign either a '0' 
or a T based simply on which mean value they are closar to. Stated sunply, if the suspect image 
is mdeed a copy of our original, the embedded 32-bit resulting code should match that of our 
records, and if it is not a copy, we should get general randomness. TTie third and the fourth 
possibilities of 3) Is a copy but doesn't match identification number and 4) isn't a copy but does 
20 match are, m the case of 3), possible if the signal to noise ratio of the process has plummeted, i.e. 
die 'suspect image' is truly a very poor copy of the original, and in the case of 4) is basically one 
chance in four bilUon smce we were using a 32-bit identification number. If we are truly worried 
about 4), we can just have a second independent lab perform their own tests on a different issue of 
the same magazine. Finally, checking the eiror-check bits against what the values give is one final 
25 and possibly overkill check on the whole process. In situations wh«e signal to noise is a possftle 
problem, these error checking bits might be elimmated without too much hann. 
Benefits 

Now that a full desaiption of the first embodiment has been described via a 
detailed example, it is appropriate to point out the rationale of some of the process steps and theu- 
30 benefits. 

The ultimate benefits of the foregoing process are that obtaining an identification 
number is fully independent of the manners and methods of preparing the difference image. That 
is to say, the manners of preparing the difference image, such as cutting, registering, scaling, 
etcetera, cannot increase the odds of finding an identification number when none exists; it only 
35 helps the signal-to-noise ratio of the identification process when a true identification number is 
present. Methods of preparing images for identification can be different from each other even, 
providing the possibility for multiple independent methodologies for making a match. 
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Hie ability to obtain a match even on sub-sets of the original signal or image is 
a key point in today's information-rich world. Cutting and pasting both images and sound clips is 
becoming more common, allowing such an embodiment to be used in detecting a copy even when 
original material has been thus corrupted. Finally, the signal to noise ratio of matching should 
begin to become difficult only when the copy material itself has been significantly altered either 
by noise or by significant distortion; both of diese also will affect that copy's commercial value, 
so that trying to thwart the system can only be done at the expense of a huge decrease in 
commopcial value. 

An early conception of this invention was the case where only a single "snowy 
image" or random signal was added to an original image, i.e. the case where N=l. "Decoding" 
this signal would mvolve a subsequent mathematical analysis using (generally statistical) 
algorithms to make a judgment on the presence or absence of this signal. The reason this 
approach was abandoned as the prefeired embodiment was that there was an inherent gray area in 
the certainty of detecting the presence or absence of the signal. By moving onward to a multitude 
15 of bit planes, i.e. N > 1, combined with simple pre-defined algorithms presaibing the manner of 
choosing between a "0" and a "1", the mvention moved the certainty question from the reabn of 
ejqjcrt statistical analysis into the reahn of guessing a random binary event such as a com flip. 
Hiis is seen as a powerful feature relative td the intuitive acceptance of this invention m both the 
courtroom and the maxketplace. The analogy whidi summarizes the inventor's thoughts on this 
20 whole question is as follows: Tbe search for a single identification signal amounts to callmg a 
coin flip only once, and relying on arcane experts to make the call; whereas the N>1 prefeired 
embodiment of this invention relies on the broadly intuitive principle of conectly calling a com 
flip N times in a row. This situation is greatly exacert)ated, i.e. the problems of "interpretation" of 
the presence of a single signal, when images and sound clips get smaller and smaller in extent. 

Another important reason that the N>1 case is the preferred embodiment over the 
N=l embodiment is that in the N=l case, the manner in which a suspect image is prepared and 
manipulated has a dh-ect bearing on the likelihood of making a positive identification. Thus, the 
manna- with which an expert makes an identification detennmation becomes an integral part of 
that determination. The existence of a multitude of mathematical and statistical approaches to 
30 making this determination leave open the possibility that some tests might make positive 

identifications while others might make negative detennmations, invitmg further arcane debate 
about the relative merits of the various identification approaches. The N>1 preferred mbodunent 
of this invention avoids this further gray area by presenting a method where no amount of pre- 
processmg of a signal - other than pre-processing which surreptitiously uses knowledge of the 
35 private code signals - can increase the likelihood of "calling the coin flip N times in a row." 

The fullest expression of the present system will come when it becomes an 
industry standard and numerous independent groups set up with their own means or 'in-house' 
brand of applying embedded identification numbers and in their decipherment. Numerous 
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indcpendent group identificatiim ^ fliftber enhance the ulto 
thereby raihancing its i^jpeal as an industry standard. 

«f Tn.,. Pnlnritv in Ct "^ ? the Composite Embedded Code Signal 

Hie foregoing discussion made use of the 0 and 1 formalism of binary 
5 technology to accomplish its ends. Specifically, the O's and I's of the N-bit identification word 
directly multiplied their corresponding individual embedded code signal to form the composite 
embedded code signal (step 8. figure 2). Tte approach certainly has its conceptual simplicity, but 
the multiplication of an embedded code signal by 0 along with the storage of that embedded code 

contains a kind of inefficiency. 
,Q It is prefiared to rnaintain the formalism of the 0 and! nature of the N-b^^ 

identification word, but to have the O's of the word induce a subtraction of their corresponding 
embedded code signal. Thus, in step 8 of figure 2. rather than only 'adding' the individual 
embedded code signak which correspond to a '1' in the N-bit identification word, we will also 
'subtract' the individual embedded code signals which correspond to a '0' in the N-bit 

15 identification word. 

At first glance this seems to add more apparent noise to flie final composite 
signal. But it also increases the energy-wise separation of the O's from the I's. and thus the 'gain' 
which is appUed in step 10. figure 2 can be conespondingly lower. 

We can refer to fliis hnprovement as tfie use of true polarity. The man 
20 advantage of ftis improvement can largely be summarized as 'informational efficiency.' 
»P«rr«.ti.a1 Orthof'«"''H*v' "f the TnHividnal Embedded Code Signals 

TTie foregoing discussion contemplates the use of generally random noise-like 
signals as the individual embedded code signals. TWs is perhaps the simplest form of signal to 
generate. However, there is a form of informational optimization which can be applied to the set 
25 of the individual embedded signals, which the applicant describes under the rubric 'perceptual 
orthogonaUty.' This term is loosely based on the mathematical concept of the orthogonality of 
vectors, with the current additional requirement that this orthogonality should maximize the signal 
energy of the identification information whUe maintaming it below some perceptibUity threshold. 
Put another way, &e embedded code signals need not necessarily be random in nature. 
30 tw «nA inunnvement r Vn^ Fmhodimn.t in the Field of Em.lsion-Based Photography 

The foregoing discussion outlined techniques that are applicable to photograpWc 
materials. The following section explores the details of this area further and discloses certain 
improvements which lend themselves to a broad range of applications. 

The first area to be discussed involves the pre-application or pre-exposing of a 
35 serial number onto traditional photographic Foducts, such as negative fihn. print paper. 

transparencies, etc. In general, this is a way to embed a priori unique serial numbers (and by 
implication, ownership and tracking information) into photographic material. The serial numbers 
themselves would be a permanent part of the normally exposed picture, as opposed to being 
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relegated to the margins or stamped on the back of a printed photograph, which all require 
separate locations and separate methods of copying. TTie 'serial number' as it is called here is 
generaUy synonymous with the N-bit identification word, only now we are using a moie common 
industrial terminology. 

In Figure 2, step 11, the disclosure calls for the storage of the "original [imagej" 
along with code images. Tien in figure 3. step 9, it directs that the original be subtracted from 
the suspect hnage. thereby leaving the possible identification codes phis whatever noise and 
comiption has accumulated. Therefore, the previous disclosure made the tadt assumption that 
there exists an original without the composite embedded signals. 

Now in the case of selling print paper and other duplication fihn products, this 
will still be the case. i.e.. an "original" without the embedded codes wiU indeed exist and the basic 
methodology of the first embodhnent can be employed. Hie origmal fihn serves perfertly well as 
an 'unencoded origmal.' 

However, in the case where Fe-exposed negative fihn is used, the composite 
embedded signal pie^ on the original fihn and thus there will never be an "original" separate 
from the pie^bedded signal. It is this latter case, therefore, which wiU be examined a bit more 
closely, along with observations on how to best use the principles discussed above (the former 
cases adhoing to the previously oudmed methods). 

The clearest point of departure for ihe case of pre-numbered negative fihn. i.e 
negative fihn which has had each and every frame pitted with a very fidnt ami unique 
composite embedded signal, comes at step 9 of figure 3 as previously noted. THere are ccrtamly 
other differences as well, but they are mostly logistical in natme, such as how and when to embed 
the signals on the fihn, how to store the code numbers and serial nmnber, etc. Obviously the pre- 
exposmg of fihn would involve a major change to the general mass production process of creatmg 
and padcaging fihn. 

Fig. 4 has a schematic outlmmg one potential post-hoc mechanism for pre- 
exposing fihn. 'Post-hoc' refers to applymg a process after the fiiD common manufacturing 
processoffihnhasalreaclytakenptece. Eventually, economies of scale may dictate plactog this 
Fe-exposing process directly mto the cfaam of mamifecturing fihn. Depicted in Fig. 4 is what is 
commonly known as a fihn writing system. Jbt computer. 106. dispteys the composite sigmd 
produced in step 8, figure 2, on its phosphor screen. A given frame of fihn is then exposed by 
imaging this phosphor screen, where the exposure level is generally very feint, i.e. generally 
imperceptible. Clearly, the marketplace will set its own demands on how faint this should be, that 
is, the level of added 'grammess' as practitioners would put it. Each frame of fihn is sequentially 
exposed, where m general the composite image displayed on the CRT 102 is changed for each and 
every frame, thereby givmg each frame of fihn a different serial number. lUe transfer lens 104 
highlights the focal conjugate planes of a fihn frame and die CRT fiice. 
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Getting back to the q)plymg the principles of the foregoing embodiment in the 
case of pre^xposed negative film... At step 9, figure 3. if we were to subtract the "original" with 
its embedded code, we would obviously be "erasing" the code as well since the code is an integral 
part of the origmal. Fortunately, remedies do exist and identifications can stiU be made. 
5 However, it will be a challenge to artisans who refine this embodiment to have the signal to noise 
ratio of the identification process in the pre-exposed negative case approach the signal to noise 
ratio of the case where Ae un-mcoded original exists. 

A sucdnct definition of the problem is to order at fliis pomt Givm a suspect 
picture (signal), find the embedded identification code IF a code exists at al. TTie problem reduces 
10 to one of findtog the ampUtude of each and every individual embedded code signal within the 
suspect picture, not only wiliun the context of noise and corruption as was previously explained, 
but now also withm the context of the coupling between a captured image and the codes. 
•Couplmg' here refers to the idea that the captured image "randomly biases" the cross-correlation. 

So, bearing in mind this additional item of signal coupling, the identification 
15 process now estimates the signal ampUtude of each and every individual embedded code signal (as 
opposed to taking the cross-correlation result of step 12, figure 3). If our identification signal 
exists m the suspect picture, the amplitudes thus found will split mto a polarity with positive 
amplitudes being assigned a '1' and negative amplitudes bemg assigned a '0'. Our unique 
identification code manifests itself. It on the other hand, no such identification code exists or it is 
20 someone else's code, then a random ganssian-like distribution of amplitudes is found with a 

random hash of values. 

It remains to provide a few more details on how the amplitudes of the individual 
embedded codes are found. Again, fortmiately, this exact problem has been treated in other 
technological applications. Besides, throw this problem and a little food into a crowded room of 
25 mathematicians and statisticians and surely a half dozen optimized methodologies will pop out 
after some reasonable period of time. It is a rather cleanly defined problem. 

One specific example solution comes fimm the field of astronomical imaging. 
Here, it is a mature prior art to subtract out a "themal noise frame" from a given CCD image of 
an object. Oiten, however, it is not precisely known what scaling factor to use in subtracting the 
30 thermal frame, and a seareh for the correct scaltog fector is performed. This is precisely the task 
of Ais step of the present embodiment 

General practice merely performs a common search algorithm on the scaling 
fector, where a scaling factor is chosen and a new image is areated according to: 

NEW IMAGE = ACQUIRED IMAGE - SCALE * THERMAL IMAGE (4) 
35 11,6 new image is applied to the festfourier transform routme and a scale fector 

is eventually fomid which mtoimizes the integrated high frequency content of the new image. 
This general type of search operation with its minhnization of a particular quantity is exceedingly 
common. n,e scale fector thus found is tiie sought-for "ampUtode." Refinements wUch are 
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comemplated but not yet implemented ate where the coupling of the higher derivatives of the 
acquired image and the embedded codes are estimated and removed from the calculated scale 
fector. In other words, certain bias efifects from the coupling mentioned earlier are present and 
should be eventually accounted for and removed both through theoretical and empirical 
e}q>erimentati(»i. 

Use and Improvemaitg in the Detection of Signal or Iina f>f> Al»P»ri»« . 

Apait from the basic need of identifying a signal or image as a whole, there is 
also a rather ubiquitous need to detea possible alterations to a signal or image. The following 
section descn-bes how the foregoing embodiment, with certain modifications and improvements, 
can be used as a powerful tool in this area. Ihe potential scemuios and applications of deteetiiig 
altoations are innumerable. 

To first smmnarize, assmne that we have a given signal or image which has been 
positively identified using the basic methods outlined above. In other words, we know its N-bit 

identification word, its individual embedded code signals, and its composite embedded code. We 
can then fairly simply create a spatial map of the composite code's amplimde within our given 
signal or image. Fmthennoie. we can divide this amplitude map by the known composite code's 
spatial amplitude, giving a normalized map. i.e. a map which should flurtuate about some global 
mean value. By sunple examhiation of this map, we can visuaUy detect any areas which have 
been significantly altered wherein the vahie of the normalized ampUtude dips below some 
statistically set threshold based purely on typical noise and corruption (enor). 

The details of unplementing the creation of the amplitude map have a variety of 
choices. One is to perform the same procedme which is used to determme the signal ampUtude as 
described above, only now we step and repeat the multiplication of any given area of the 
signal/image with a gaussian weight function centered about the area we ai« mvestigating. 
Universal Versus Custom C^k 

The disclosure thus far has outlined how eadi and every source signal has its 
own unique set of individual embedded code signals. This entails the storage of a significant 
amount of additional code information above and beyond the orighud. and many applications may 
merit some form of economizmg. 

One such approach to economizmg is to have a given set of mdividual embedded 
code signals be common to a batch of source materials. For example, one thousand images can aU 
utilize the same basic set of individual embedded code signals. The storage requirements of these 
codes then become a small fraction of the overall storage requirements of the source material. 

Furdiermore, some applications can utilize a universal set of individual embedded 
code signals, it., codes which remam the same for all instances of distributed material. This type 
of requirement would be seen by systems which wish to hide the N-bit identification word itself, 
yet have standardized equipment be able to read that word. This can be used m systems which 
make go/no go decisions at pomt-of-read locations. The potential drawback to this set-up is that 
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the univeisal codes are more prone to be sleuthed or stolen; therefore they wiU not be as secure as 
the apparatus and methodology of the previously disclosed arrangement. Perhaps this is just the 
difference between 'high security' and 'ah-tight security.' a distinction carrying Utde weight with 
the bulk of potential applications. 
5 TkB in Printing. Pa f^^, nncm^nts. P l astic Coated Identification Cards, and Other Material Where 

Olobal Embedded Codes Ca n Be Imprinted 

The term 'signal' is often used narrowly to refiar to digital data information, 

audio signals, images, etc. A broader interpretation of 'signal,' and the one more generally 
intended, mcludes any form of modulation of any material whatsoever. Thus, the micro-topology 
10 of a piece of common paper becomes a 'signal' (e.g. it height as a function of x-y coordinates). 
TTie reflective properties of a flat piece of plastic (as a fimction of space also) becomes a signal. 
Tht point is that photographic emulsions, audio signals, and digitized information are not the only 
types of signals enable of utilizing the principles of tiie present invention. 

As a case in point, a machine very much resembling a braille printing machine 
can be designed so as to imprint unique 'noise-like' mdentations as outlined above. Tbese 
indentations can be applied with a pressme which is much smaller than is typically applied in 
creating braiUe, to the point whens the patterns are not noticed by a normal user of the paper. But 
by foUowing tiie steps of the present disclosure and applymg them via the mechanism of micro- 
mdentations, a unique identification code can be placed onto any given sheet of paper, be it 
20 irtended for everyday stationary pmposes, or be it for important documents, legal tender, 0^ 

secured materiaL 

Tbe reading of the identification material in such an embodunent gen»ally 
proceeds by merely reading the document optically at a variety of angles. Ibis would become an 
inexpensive method for deducing tiie micro-topology of the paper surfi>ce. Certainly other forms 
25 of reading tiie topology of the p^er are possible as weU. 

In tiie case of plastic encased material such as identificati<m cards. e.g. driver's 
Bcenses. a shnilar braUle-like impressions machine can be utilized to imprint unique identification 
codes. Subfle layers of photoreactive materials can also be embedded inside the plastic and 
'exposed' 

It is clear that wherever a material exists which is capable of being modulated by 

'noise-like- signals, tiiat material is an appropriate carrier for unique identification codes and 

utilization of die principles of the invention. All tiiat remains is die matter of economically 

applying the identification information and maintaining tiie signal level below an acceptabflity 

threshold which each and every application wfll define for itself 

35 /A ppendix A Description 

Appendix A contains fte source code of an implementation and verification of 

tire foregoing embodiment for an 8-bit black and white unaging system. 
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REAL TTME FumnPR 

While the fiist class of embodiments most commotdy employs a standard 
microprocessor or computer to perfonn the encodation of an image or signal, it is possible to 
utilize a custom encodation device which may be fjister than a typical Von Neuman-lype 
processor. Such a system can be utilized with all manner of serial data streams. 

Music and videotape recordings are examples of serial data streams - data 
streams which are often pirated. It would assist enforcement cffoite if authorized recordings were 
encoded with identification data so that pirated knock-offe could be traced to the original from 
which tfa^ were made. 

Piracy is but one concern driving the need for the present invention. Another is 
authentication. Often it is important to confirm that a given set of data is really what it is 
purported to be (often several years after its generation). 

To address these and other needs, the system 200 of Fig. 5 caii be employed. 
System 200 can be thought of as an identification coding black box 202. The system 200 receives 
an input signal (sometimes termed the "master" or "unencoded" signal) and a code word, and 
produces (generally in real time) an identification^oded output signal. (UsuaUy, the system 
provides key data for use in lata- decoding.) 

The contents of the "black box" 202 can take various forms. An exemplaiy 
black box system is shown in Fig. 6 and includes a look-up table 204, a digital noise somce 206. 
first and second scalers 208, 210, an adder/subtracter 212, a memoiy 214. and a register 216, 

TTie input signal (which in the ilhistrated embodiment is an 8 - 20 bit data signal 
provided at a rate of one million samples per second, but which in other embodiments could be an 
analog signal if appropriate AID and D/A conversion is provided) is applied fiwn an input 218 to 
the address input 220 of the look-up table 204. For each input sample (i.e. look-up table address), 
the table provides a corresponding 8-bit digital output word. This output word is used as a scaling 
faaor that is qjpUed to one mput of the first scaler 208. 

The first scaler 208 has a second input, to wUch is applied an 8-bit digital noise 
sigmd fiom source 206. (In the illustrated embodiment, the noise souree 206 comprises an analog 
noise souree 222 and an amdog-to-digital converter 224 although, again, other implementations can 
be used.) The noise somt» in the illustrated embodunent has a zero mean output value, with a 
fiill width half maximum (FWHM) of 50 - 100 digital numbers (e.g. fiom -75 to +75). ' 

nie fust scaler 208 multiplies the two 8-bit words at its inputs (scale fector and 
noise) to produce - for each sample of the system input signal - a 16-bit output word. Since the 
noise signal has a zero mean value, the output of the first scaler likewise has a zero mean value. 

The output of the first scaler 208 is applied to the input of the second scaler 210. 
•me second scaler serves a global scaling function, establishing the absolute magnitude of the 
identification signal that will ultimately be embedded mto the input data signal. lUe scaling factor 
is set through a scale control device 226 (which may take a number of forms, from a simple 
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rheostat to a graphically implemented control in a graphical user interface), permitting this factor 

to be changed in accordance with the requirements of different applications. The second scaler 

210 provides on its output line 228 a scaled noise signal. Each sample of this scaled noise signal 

is successively stored in the memory 214. 
5 (In the illustrated embodiment, the output from the first scaler 208 may range 

between -1500 and +1500 (decunal), while the output from the second scsler 210 is m the low 

single digits, (such as between -2 and +2).) 

Register 216 stores a multi-bit identification code word. In &e illustrated 

embodiment this code word consists of 8 bits, altiiough larger code words (up to hundreds of bits) 
10 are commonly used. Ihese bits are referenced, one at a time, to control how the input signal is 

modulated with tiie scaled noise signal. 

In particular, a pomter 230 is cycled sequentially through the bit positions of the 

code word in register 216 to provide a control bit of "0" or "1" to a control input 232 of the 

adder/subtracter 212. If, for a particular input signal sample, the control bit is a "1", the scaled 
15 noise signal sample on line 232 is added to the input signal sample. If flie control bit is a "0", the 

scaled noise signal sample is subtracted from the input signal sample. Ihe output 234 from the 

adder/subtracter 212 provides the blade box's ou^t signal. 

Ihe addition or subtraction of the scaled noise signal in accordance with the bits 

of the code word effects a modulation of the input signal ihat is generally impesreptible. 
20 However, with knowledge of the contents of the memory 214, a user can later decode the 

encoding, detennining the code number used in the original encoding process. (Actually, use of 

memory 214 is optional, as explained below.) 

It will be recognized that the encoded signal can be distributed in well known 

ways, including converted to printed image form, stored on magnetic media (floppy diskette, 
25. analog or DAT tape, etc.), CD-ROM, etc. ^. 

Decoding 

A variety of techniques can be used to determine the identification code with 
which a suspect signal has been encoded. Two are discussed below. The first is less preferable 
than the latter for most applications, but is discussed herein so that the reader m^ have a fuller 
30 context wifhm which to understand the invention. 

More particularly, the first decoding method is a difference method, relying on 
subtraction of corresponding samples of the original signal from the suspect signal to obtain 
difference samples, which are then examined (typically individually) for deterministic coding 
indicia (i.e. the stored noise data). This approach may thus be termed a "sample-based, 
35 deterministic*' decoding technique. 

The second decoding method does not make use of the original signal. Nor does 
it examine particular samples looking for predetermmed noise characteristics. Rather,* the statistics 
of the suspect signal (or a portion thereof) are considered in the aggregate and analyzed to discern 
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the presence of identification coding that permeates the entire signal. The reference to permeation 
means the entire identification code can be discerned fi-om a small fi^ent of the suspect signal. 
This latter approach may thus be termed a "hologr^hic, statistical" decoding technique. 

Both of these methods begin by registering the suspect signal to match the 
5 original. Ihis oitails scaling (e.g. in amplitude, duration, color balance, etc.), and sampling (or 
resampling) to restore the original sample rate. As m the earlier described embodiment, there are 
a variety of well understood techniques by which the operations associated wi& this registration 
function can be performed. 

As noted, the first decoding approach proceeds by subtracting the original signal 
10 fi-om the registered, suspect signal, leaving a difference signal. The polarity of successive 

difference signal samples can then be compared with the polarities of the corresponding stored 
noise signal samples to determine the identification code. TTiat is, if the polarity of the first 
difference signal sample matches that of the first noise signal sample, then the first bit of the 
identification code is a "1." (In such case, the polarity of the 9th, 17th, 25th, etc. samples should 
15 also all be positive.) If the polarity of the first difference signal sample is opposite that of the 
corresponding noise signal sample, then the first bit of the identification code is a "0." 

By conducting the foregoing analysis with eight successive samples of the 
difference signal, the sequence of bits that comprise the original code word can be determined. If, 
as in tiie preferred embodunent pointer 230 stq>ped througji the code word one bit at a time, 
20 beginning with the first bit, during encoding, then the first 8 samples of the differaice signal can 
be analyzed to uniquely determine the value of the 8-bit code word. 

In a noise-free worid (speakmg here of noise independent of that with which the 
identification coding is effected), the foregoing analysis would always yield the correct 
identification code. But a process that is only ^plicable in a noise-fi-ee worid is of limited utility 
25 indeed. 

(Further, accurate identification of signals in noise-free contexts can be handled 
in a variety of other, simpler ways: e.g. checksums; statistically improbable conespondence 
between suspect and original signals; etc.) 

While noise-induced aberrations in decoding can be dealt with - to some degree 
30 - by analyzing large portions of the signal, such aberrations still place a practical ceiling on the 
confidence of the process. Further, the villain that must be confronted is not always as boiign as 
random noise. Rather, it increasingly takes the form of human-caused corruption, distortion, 
manipulation, etc. In such cases, the desired degree of identification confidence can only be 
achieved by other approaches. 

presently preferred approach (the "holographic, statistical" decoding 
technique) relies on recombining the suspect signal with certain noise data (typically the data 
stored in memory 214), and analyzing the entropy of the resulting signal. "Entropy" need not be 
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understood in its most strict mathematical definition, it being merely the most concise word to 
describe randomness (noise, smoothness, snowiness, etc.). 

Most serial data signals are not random. That is, one sample usually correlates - 
to some degree - with the adjacent samples. Noise, in contrast, typicaUy is random. If a random 
signal {e.g. noise) is added to (or subtracted from) a non-random signal, the entropy of the 
resulting signal generally increases. That is, the resulting signal has more random variations than 
the origmal signal. This is the case with the aicoded outpvt signal produced by the present 
encoding process; it has more entropy than the original, unencoded signal. 

It in contrast, the addition of a random signal to (or subtracticm from) a non- 
random signal r^uaw entropy, Aen somethmg unusual is h^pen^ It is this anomaly that the 
preferred decodmg process uses to detect anbedded identification coding. 

To fiiUy understand this entropy-based decoding method, it is first helpfiil to 
highlight a charactmstic of the original encoding process: the similar treatment of every eighth 
sample. 

15 In the encoding process discussed above, the pointer 230 mcrements through the 

code word, one bit for each successive sample of the input signal. If the code word is eight bits 
in length, then the pomter returns to the same bit position m the code word evoy eighth signal 
sample. If this bit is a "r, noise b added to the mput signal; if this bit is a "0", noise is 
subtracted from the input signal. Due to the cyclic progression of the pomter 230, eveiy eighA 

20 sample of an encoded signal thus shares a diaracteristic: tiiey are aU either augmented by the 
correspondmg noise data (which may be negative), or they are all diminished, depending on 
whether the bit of the code word then being addressed by pointer 230 is a T or a "0". 

To exploit this characteristic, the entropy-based decoding process treats every 
eighth sample of the suspect signal in like feshion. In particular, the process begins by adding to 

25 the 1st, 9th, 17th, 25th, etc. samples of the suspect signal the corresponding scaled noise signal 
values stored in the memory 214 (i.e. those stored in the 1st, 9th, 171h, 25th, etc., memory 
locations, respectively). Tlie entropy of the resulting signal (i.e. the suspect signal with ev«y 8th 
sample modified) is then computed. 

(Computation of a signal's entropy or randomness is well understood by artisans 

30 in this field. One generally accepted technique is to take the derivative of the signal at each 
sample point, square these values, and then sum over the entire signal. However, a variety of 
other well known techniques can alternatively be used.) 

The foregoing step is then repeated, this time subtracting the stored noise values 
from the 1st, 9th, 17th, 25 etc. suspect signal samples. 

35 One of these two operations will undo the encoding process and reduce the 

resuking signal's entropy; the other will aggravate it. If adding the noise data in memory 214 to 
the suspect signal reduces its entropy, then this data must earlier have been subtracted from the 
original signal. Hiis indicates that pointer 230 was pointing to a "0" bit when these samples were 
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encoded. (A "0" at the control input of adder/subtiacter 212 caused it to subtract the scaled noise 
from the input signal.) 

Conversely, if subtracting the noise data from every eighth sample of the suspect 
signal reduces its entropy, then the encoding process must have earlier added this noise. This 
indicates that pointer 230 was pointing to a "l" bit when samples 1, 9, 17, 25. etc.. were encoded. 

By noting whether entropy deoeases by (a) adding or (b) subtracting the stored 
noise data to/from the suspect signal, it can be deteimined that the first bit of the code word is (a) 
a "0", or (b) a "1". 

The foregoing operati<His are then conducted for the group of spaced samples of 
the suspect signal begmning with the second sample (i.e. 2, 10, 18, 26 ...). The entropy of the 
resulting signals indicate whether the second bit of flie code word is a "0" or a "1". Likewise wift 
the following 6 groqjs of spaced samples in the su^ signal, until aU 8 bits of the code word 
have heea discerned. 

It will be appreciated that the foregoing approach is not sensitive to comiption 
mechanisms that alter the values of individual samples; instead, the process considers the entropy 
of the signal as a whole, yieldmg a high degree of confidence in the results. Further, even small 
cxceipts of the signal can be analyzed in this manner, pennitting piracy of even small details of an 

original work to be detected. Hie results are thus statistically robust, both in the fece of natural 
and human conuptian of the suspea signaL 

It will further be appreciated that the use of an N-bit code word in this real time 
embodiment provides benefits analogous to those discussed above in connection with the batch 
encoding system, (todeed, the present embodhnent may be conceptualized as making use of N 
different noise signals, just as in the batch encoding system. The first noise signal is a signal 
having the same extent as the input signal, and comprising the scaled noise signal at the 1st. 9th, 
17th. 25th, etc.. samples (assuming N=8), with zeroes at the intervening samples. The second 
noise signal is a similar one comprising the scaled noise signal at the 2d. 10th. 18th, 26th. etc.. 
samples, with zeroes at the intervening samples. Etc. These sigmils are all combined to provide a 
composite noise signal.) One of the important advantages mherent in sudi a system is fte hi^ 
degree of statistical confidence (confidence v/bich doubles with each successive bit of flie 
identification code) that a match is really a match. The system does not rely on subjective 
evaluation of a suspect signal for a smgle. deterministic embedded code signal. 
Illustrative Variations 

From the foregoing description, it will be recognized that numerous 
modifications can be made to the illustrated systems without changing the fimdamental prmciples. 
35 A few of these variations are described below. 

The above-described decoduig process nies both adding and subtracting stored 
noise data to/fiom the suspect signal in order to find which operation reduces entropy, hi other 
embodhnents, only one of these operatiomi needs to be conducted. For example, in one alternative 
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decoding process the stored noise data corresponding to evay eightli sample of the suspect signal 
is only add^ to said samples. If the entropy of the resulting signal is thereby increased, then the 
corresponding bit of the code word is a "l" (i.e. this noise was added earlier, during the encoding 
process, so adding it again only compounds the signal's randomness). If the entropy of the 
resulting signal is thereby decreased, then the corresponding bit of the code word is a "0". A 
forthw test of entropy if the stored noise samples are subtr^e4 is not required. 

Tbe statistical idiabiUty of fte identification i»ocess (coding and decoding) can 
be designed to exceed virtually any confidence threshold (e.g. 99S%, 99.99%, 99.999%, etc. 
confidence) by appropriate selection of the global scaling fectors, etc. Additional confidence in 
any given application (unnecessary m most applications) can be achieved by recheddng the 
decoding process. 

One way to rediedc the decoding process is to remove the stored noise data from 
the suspect signal in accordance with the bits of the discerned code word, yielding a "restored" 
signal (e-g. if the first bit of the code word is found to be "1," then tiie noise samples stored in the 
1st, 9th, 17th, etc. locations of the memory 214 are subtracted firtmi Ae cwresponding samples of 
the suspect signal). Tlie entropy of the restored signal is measured and used as a baseline in 
further measurements. Next, ti» process is repeated, this time removing tiie stored noise data 
from the suspect signal in accordance with a modified code word. The modified code word is the 
same as the discerned code word, except I bit is toggled (e.g. tiie first). Hie entropy of the 
20 resulting signal is detennine4 and compared with tije baseline. If the toggling of flie bit in the 
discerned code word resulted in mcreased entropy, then the accuracy of that bit of the discerned 
code word is confirmed. The process repeats, each time with a different bit of tiie discemed code 
word toggled, until all bits of the code word have been so checked. Each change should result in 

an increase in entropy compared to the baseline value. 
25 The data stored in memory 214 is subject to a variety of alternatives. In tiie 

foregoing discussion, memoiy 214 contains tiie scaled noise data. In otiier embodiments, flie 
unsealed noise data can be stored instead. 

In stiU oflier embodimmts, it can be desirable to store at least part of tiie input 
signal itself in memory 214. For example, tiie memory can allocate 8 signed bits to tiie noise 
30 sample, and 16 bits to store tiic most significant bits of an 18- or 20-bit audio signal sample. TTiis 
has several benefits. One is tiiat it simplifies registration of a "suspect" signal. Anotiier is tiiat, in 
tiie case of encoding an input signal which was already encoded, tiie data in memory 214 can be 
used to discern which of the encoding processes was performed first. That is, from tiie input 
signal data in memory 214 (albeit incomplete), it is generaUy possible to detennme witii which of 
35 two code words it has been encoded. 

Yet anotiier altonative for memory 214 is fliat is can be omitted altogetiiw. 
One w^ tills can be achieved is to use a deterministic noise source in flie 
encoding process, such as an algoritimiic noise generator seeded witii a known key nmnber. The 
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same detenninistic noise source, seeded with the same key mmiber. can be used in the decoding 
process. In such an arrangement, only the key nmnber needs be stored for later use in decodmg. 
instead of the large data set usually stored in memoiy 214. 

Alternatively, if the noise signal added during encoding does not have a zero 
mean value, and the length N of the code word is known to the decoder, then a universal decoding 
process can be implemented. This process uses the same entropy test as the foregoing procedures, 
but cycles through possible code words, adding/subtracting a small dmmny noise value (e.g. less ' 
than the expected mean noise vahie) to every Nth sample of the suspect signal, in accordance with 
the bits of the code word bemg tested, until a reduction in entropy is noted. Such an approach is 
not fiivored for most applications, however, because it ofifeis less security than the other 
emboduaats (e.g. it is subject to craddng by teute force). 

Many qjpUcations are well swved by the embodiment ilhistrated in Fig. 7. in 
which different code words are used to produce several differently encoded versions of an Lput 
signal, each making use of die same noise data. More particularly, the embodmient 240 of Fig. 7 
includes a noise store 242 into which noise from source 206 is written during the identification- 
coding of the mput signal with a first code word. (The noise souroe of Fig. 7 is shown outside of 
the real time encoder 202 for convenience of illustiation.) Thereafter, additional identification- 
coded versions of Ae input signal can be produced by read«g the stored noise data from the stoic 
and using it in conjunction widi second dirough Ntii code woids to encode the sigmd. (While 
bmaty-sequential code words are iHustnited in Fig. 7. in odier embodhnents arbitnny sequences of 
code words can be employed.) With such an ammgement. a great nmnber of differenfly-encoded 
signals can be produced, without requiring a proportionally-sized long tenn noise memoiy. 
Instead, a fnted amount of noise data is stored, whether encoding an original once or a thousand 



times. 



(If desired, several differently-coded output signals can be produced at the same 
thne, rather than seriatim. One such implementation inchides a plurality of adder/subtracter 
circuits 212. each driven with d>e same input signal and with the same scaled noise signal, but 
with different code words. Each, dien, produces a differentiy encoded output sigmd.) 

In applications havmg a great number of difFerentiy-encoded versions of the 
same origimd, it will be recognized that the decoding process need not always discern every bit of 
the code word. Sometm.es. for example, the application may require identifying only a group of 
codes to which the suspect signal belongs. (E.g., high order bits of the code word might mdicate 
an organization to which several differently coded versions of die same source material were 
provided, with low-order bits identifying specific copies. To identify the organization widi which 
a suspect signal is associated, it may not be necessary to examine the low order bits, since the 
organization can be identified by the high order bits alone.) If the identification requirements can 
be met by discemmg a subset of die code word bits in die suspect signal, die decoding process can 
be shortened. 
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Some ^plications may be best served by restarting the encoding process - 
sometimes with a different code word - several times witiiin an integral work. Consider, as an 
example, videotaped productions (e.g. television programming). Eadi frame of a yiieaOped 
production can be identification-coded with a unique code number, processed in real-time with an 
arrangement 248 like that shown in Fig. 8. Each time a vertical retrace is detected by sync 
detector 250. tiie noise source 206 resets (e.g. to repeat the sequence just produced) and an 
identification code increments to the next vahie. Each frame of the videotape is thereby uniquely 
identificatioo-coded. TVpicaUy. tiie encoded signal is stored on a videotape for long tenn storage 
(although other storage media, mcluding laser disks, can be used). 

Returning to flie aicodmg apparatus, the look-up table 204 in tiie illustrated 
embodiment exploits the fiirt tiiat high amplihide samples of the input data signal can tolerate 
(without objectionable degradation of the output signal) a higher level of encoded identification 
coding tiian can low amplitude input samples. TTius, for example, input data samples having 
decimal values of 0, 1 or 2 may be correspond (in tiie look-up table 204) to scale factors of unity 
15 (or even zero), whereas input data samples having values in excess of 200 may coirespond to scale 
factors of 15. GeneraUy speaking, tiie scale fectors and tiie input sample vahies coitespond by a 
square root relation. That is, a four-fold mcrease in a value of the sanqiled iapat signal 
. corresponds to approximately a two-fold increase in a value of tiie scaling fector associated 
therewitiL 

20 (The paioitiietical reference to zero as a scaling fertor alludes to cases, e.g., in 

whidi tiie source signal is temporaUy or spatially devoid of information content. In an image, for 
example, a region characterized by several contiguous sample values of zero may correspond to a 
jet black region of tiie frame. A scaling vatoe of zrao may be qipropriate here since tiiere is 
essentially no image data to be pirated.) 
25 Continuing wifli tiie encoding process, tiiose skilled m tiie art wiU recognized tiie 

potential for "rail enots" m tiie iUustrated embodhnent. For example, if tiie input signal consists 
of 8-bit samples, and tiie samples span tiie entire range from 0 to 255 (dedmal), tiien tiie addition 
or subtraction of scaled noise to/from tiie input signal may produce output signals tiiat cannot be 
represented by 8 bits (e.g. -2, or 257). A number of well-understood techniques exist to rectify 
tiiis situation, some of tiiem proactive and some of tiiem reactive. (Among tiiese known 
tediniques are: specifying tiiat tiie input signal shall not have samples in tiie range of 0-4 or 251- 
255, tiiereby safely permitting modulation by tiie noise signal; or including provision for detecting 
and adaptively modifying input signal samples fliat would otiierwise cause rail eirors.) 

While tiie illustrated embodimoit desaibes stepping tiirough tiie code WMd 
35 sequentially, one bit at a time, to control modulation of successive bits of flie input signal, it will 
be a^jpreciated tiiat tiie bits of tiie code word can be used oflier tiian sequentially for tiiis purpose. 
Indeed, bits of tiie code word can be selected in accordance witii any predetramined algoritimi. 
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Tbe dynamic scaling of the noise signal based <m the instantaneous vahie of the 
input signal is an optimization that can be omitted in many embodiments. Tlat is. the look^,p 
table 204 and the first scaler 208 can be omitted entirely, and the signal from the digital noise 
source 206 applied directly (or through the second, global scaler 210) to the adder/subtracter 212. 

It will be further recognized that the use of a zero-mean noise source simplifies 
the ilhistiated embodiment, but is not necessary to the invention. A noise signal with another 
mean value can r«adily be used, and D.C. compensation (if needed) can be effected elsewhere in 
the system. 

-nie use ofanoise source 206 is also optiomil. A variety of other signal somces 
can be used, dependmg on application- dependent constraints (eg. the threshold at which the 
encoded identification signal becomes perceptible). In many iiutances. the level of the embedded 
Identification signal is low enough that the identification signal needn't have a random aspect; it is 
imperceptible regardless of its nature. A pseudo random source 206. however, is usually desired 
because it provides the greatest identification code signal S/N ratio (a somewhat awkward term in 
this instance) for a level of imperceptibility of the embedded identification signal. 

It will be recognized that identification coding need not occur after a signal has 
been reduced to stored fonn as data (i.e. -fixed in tangible fenn." in the words of the U.S. 
Copyright Act). Consider, fta- example, the case of popular musicians ^ose performance are 
often recorded illicitly. By identification coding the audio before it drives concert hall speakers 

unautfaorizedrecordingsoftheconcertcanbetracedtoaparticularplaceandtime. Likewise Kve 
audio sources such as 911 emergency calls can be encoded prior to recording so as to fi«dlitate 

their later authentication. 

While the black box embodiment has been described as a stand alone unit, it wiU 
be recognized that it can be integrated into a number of different tools/instruments as a 
component One is a seamier, which can embed identification codes in the scamied output data. 
(Tie codes can simply serve to memorialize that the data was genemted by a particular scanner) 
Another is in creativity software, such as popular drawmg/graphics/animation/paint programs 
offered by Adobe, Macromedia, Corel, and the like. 

Finally, while the real-time encoder 202 has been illustrated with reference to a 
particular hardware implementation, it will be recognized that a variety of other implementations 
can alternatively be employed. Some utilize other hardware configurations. Others make use of 
software routines for some or all of the illustrated functional blocks. (He software routines can 
be executed on any number of different general pmpose programmable computers, such as 80x86 
PC-compatible computws, RISC-based workstations, etc.) 
TyPES OF NOISE OUASI-Nni SE. AND OPm^D.NOiSF 

Heretofore this disclosure postulated Gaussian noise, "white noise," and noise 
generated directly from application instrumentation as a few of the many examples of the kind of 
carrier signal appropriate to carry a single bit of information throughout an image or signal It is 
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possible to be even more proactive in "designing" characteristics of noise in order to achieve 
certain goals. Hie "design" of using Gaussian or instrumental noise was aimed somewhat toward 
"absolute" security. Hiis section of the disclosure takes a look at other considerations for the 
design of the noise signals which m^ be considered the ultimate carries of the identification 
S information. 

For some applications it might be advaitageous to design the noise carrier signal 
(e.g. the Nth embedded code signal in first embodiment; flie scaled noise data in the second 
embodiment), so as to provide more absohite signal strength to the identification sigmd relative to 
AepwcefrtibiUty of flat signal. One exarr^le is flie following. It is recognized tiwt a true 
10 Gaussian noise signal has the value '0' occur most frequently, followed by 1 and -1 at equal 
probabiUties to each other but lower than '0', 2 and -2 next, and so on. Clearly, the value zero 
carries no information as it is used in the service of this invention. TTius, one shnple adjustment, 
or design, would be that any time a zero occurs in Ae generation of the embedded code signal, a 
new process takes over, whereby the value is converted "randomly" to either a 1 or a -1. In 
15 logical terms, a decision would be made: if '0', then landom(l.-l). The Wstogram of such a 
process would appear as a Ganssian/Poissonian type distribution, except that Ae 0 bin would be 
empty and the 1 and -1 bin would be increased by half the usual histogram value of the 0 Wn. 

In this case, identiiicatiai signal en»gy would alws^rs be applied at all parts of 
the signaL A few of the trade-offe include: there is a (probably negligible) lowering of security of 
20 the codes in that a "deterinmistic component" is a part of generating the noise signal. Tbe reason 
this might be conq)letely negligible is fliat we still wind up with a coin flip type situarion on 
randomly choosing the 1 or the -1. Another trade-off is that this type of designed noise will have 
a higher threshold of pCTceptibility, and wUl only be applicable to applications where the least 
significant bit of a data stream or image is aheady negligible relative to the commercial value of 
25 the matCTial, i.e. if the least significant bit wctc stripped from the signal (for all signal samples), 
no one would know the difference and to value of the material would not suffer. This blocking 
of the zero value in flie example above is but one of many w^ to "optimize" the noise propwties 
of the signal carrier, as anyone in the art can realize. We refer to this also as "quasi-noise" in the 
sense that natural noise can be transfiwmed m a pre-determmed vray into signals which for aU 
30 intents and purposes will read as noise. Also, cryptographic methods and algorithms can easily, 
and often by definition, create signals which are poceived as completely random. TTius the word 
"noise" can have diffwent connotations, primarily between that as defined subjectively by an 
observer or listener, and that defmed mathematically. The difference of the latter is that 
mathematical noise has diff-erent properties of security and the simplicity with which it can either 
35 be "sleuthed" or the simplicity with which instruments can "automatically recognize" the existence 
of this noise. 
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"UnivCTsal" Embedded Cades 

The bulk of this disclosure teaches that for absolute security, the noise-like embedded code signals 
which cany the bits of infoimation of the identification signal should be unique to each and every 
encoded signal, or. slightly less restrictive, that embedded code signals should be generated 
sparingly, such as usmg the same embedded codes for a batch of 1000 pieces of film, for example. 
Be this as it may. there is a whole other approach to this issue wherein the use of what we will 
caU "univeisal" embedded code signals can open up laige new applications for this technology. 
The economics of these uses would be such that the de fiicto lowered security of these universal 
codes (e.g. they would be amdyzable by time honored cryptographic decoding methods, and thus 
potentially thwarted or reversed) would be economically negligible relative to the economic gains 
that the intended uses would provide. Piracy and illegitimate uses would become merely a 
predictable "cost" and a source of uncollected revenue only; a simple line item in an economic 
analysis of the whole. A good analogy of this is in the cable industry and the scrambling of video 
signals. Everybody seems to know that crafty, skilled technical individuals, who may be generally 
law abiding citizens, can climb a ladder and flip a few wires in their cable junction box in order to 
get all the pay cfaamiels for fiee. Tte cable industry knows this and takes active measures to stop 
it and prosecute those caught, but the "lost revemie" derived fiom this practice remains prevalent 
but ahnost negligible as a percentage of profits gained from the scnunbling system as a whole. 
Hie scrambling system as a whole is an economic success despite its lack of "absohite security." 

Tlie same holds true for applications of this technology wherein. fi«r the price of 
lowering security by some amount, large economic opportunity presents itself. THi, section first 
describes what is meant by universal codes, then moves on to some of the mteresting uses to 
which these codes can be applied. 

Universal embedded codes generally refer to the idea that knowledge of the exact 
codes can be distributed. The embedded codes won't be put into a dark safe never to be touched 
mita litigation arises (as alhided to in other parts of this disclosure), but instead wUl be distributed 
to various locations where on-the-spot analysis can take pbce. Genendly this distribution will 
stffl take place within a security controlled enviromnent. meaning that steps wUl be taken to limit 
the knowledge of the codes to those with a need to know. Instrumentation which attempts to . 
automatically detect copyrighted material is a non-human example of "something" with a need to 
know the codes. 

niere are many ways to implement the idea of universal codes, each with their 
own merits regarding any given application. For the purposes of teaching this art. we separate 
these approaches into three broad categories: miiversal codes based on libraries, universal codes 
based on deterministic formula, and universal codes based on pre-defined industry standard 
patterns. A rough rule of thmnb is that the first is more secure than the latter two. but that the 
latter two are possibly more economical to implement than the first. 
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TTpi v«^1 Codes: 1^ ^Jhra-ies of TTnivcTsal Codes 

H,e use of libraries of universal codes simply means that the techniques of this 
invention are employed as described, except for the fact that only a limited set of the individual 
embedded code signals are generated and that any given encoded material wUl make use of some 
sub-set of this limited Wersal set" An example is in order here. A photographic print paper 
manufacturer may wish to pr^expose every piece of 8 by 10 inch print paper which they seU with 
a unique identification code. They also wish to sell identification code recognition software to 
their large customers, service bureaus, stock agencies, and individual photographers, so that aH 
these people can not only verify that their own material is correctly marked, tat so 

also detetmuie If third party material which they are about to acquire has been identified by this 
technology as bebg copyrighted. This latter information will help them verify copyright holders 
and avoid Utigation. among many other benefits, to order to "economically" instimte this plan, 
they realize that generating unique individual embedded codes for each and every piece of print 
pq^er would generate Terabytes of independent infoimation, which would need storing and to 
which recognition software would need access. Instead, they decide to embed thefr print paper 
with 16 bit identification codes derived from a set of only 50 independent "universal" embedded 
code signals. The details of how this is done are m the nart paragraph, but the point is that now 
their recognition software only needs to oontahi a Umited set of embedded codes in their library of 
codes, typically on the order of 1 Megabyte to 10 Megabytes of mfoimation for 50x16 individual 
embedded codes splayed out onto an 8x10 photographic print (allowing for digital compression). 
The reason for picking 50 tostead of just 16 is one of a little more added security, where if it were 
the same 16 embedded codes for all photographic sheets, not only would the serial number 
capability be limited to 2 to the 16th power, but lesser and lesser sophisticated pirates could crack 
the codes and ranove them using software tools. 

There are many different ways to implement this scheme, where the foUowmg is 
but one exemplary method. It is detennined by the wisdom of company management that a 300 
pixels per mch criteria for the embedded code signals is suflBcient resolution for most appUcations. 
This means that a composite embedded code image will contain 3000 pixels by 2400 pixels to be 
exposed at a very low level onto each 8x10 sheet This gives 72 mUlion pixels. Using our 
staggered coding system such as described in the black box implementation of Figs. 5 and 6, each 
individual embedded code signal will contain only 12 million divided by 16. or approximately 
450K true mformation carrying pixels, i.e. every 16th pixel along a given raster line. These 
values will typically be in the range of 2 to -2 in digital nmnbers. or adequately described by a 
signed 3 bit number. TTie raw infonnation content of an embedded code is then approxhnately 
3/8th's bytes times 450K or about 170 Kilobytes. Digital compression can reduce this further. 
All of these decisions are subject to standard engineering optimization principles as defined by any 
given application at hand, as is well known in the art. Thus we fmd that 50 of these independent 
embedded codes will amount to a few Megabytes. This is quite reasonable level to distribute as a 



wo 95/14289 



PCT/US94/13366 



•32- 



10 



15 



20 



25 



30 



35 



"libraiy" of universal codes within the recognition software. Advanced standard enayption 
devices could be employed to mask the exact nature of these codes if one were concerned that 
would-be pirates would buy the recognition software merely to reverse engineer the univeisal 
embedded codes. TTie recognition software could simply unenaypt the codes prior to applying the 
re«»gnition tedmiques taught m this disclosure. 

Tbc recognition software itself would certainly have a variety of features, but the 
care task it would perfoim is dctenaining if there is some universal copyright ca^^ 
image. Tl»k<y questions become WfflCM 16 of the total 50 univeisal codes it might conte^ 
any, and if there are 16 fomid, what are tfaeb bit vahies. Hie key variables in deteimining the' 
answers to these quesHons are: registration, rotation, magnificatian (scale), and extent In the most 
general case with no helpfial hints whatsoever. aU variables must be independently varied across all 
mutual combinations, and each of the 50 universal codes must then be checked by adding and 
subtracting to see if an entropy decrease occurs. Strictly speaking, this is an enonnons job. but 
many helpful hints will be found which make the job much simpler, such as having an original 
image to compare to the suspected copy, or knowing the general orientation and extent of the 
image relative to an 8x10 print paper. wUch then through simple registration techniques can 
detemiine all of the variables to some acceptable degree. Hen it merely requires cycling through 
the 50 universal codes to find any decrease in entropy. If one does, then 15 other, should as well 

A protocol needs to be set up whereby a given order of the 50 translates into a sequence of most 
significant bit through least significant bit of the ID code word. Urns if we find that umversal 
code number "4" is present, and we find its bit value to be "O", and that miiversal codes "1" 
through "3" are definitely not present, then our most significant bit of our N-bit ID code nmnber 
is a "0". Likewise, we find that the next lowest universal code present is nmnber "7" and it turns 
OM to be a -1". then our next most significant bit is a T. Done properly, this system can cleanly 
trace back to the copyright owner so long as they registered their photographic paper stock serial 
nmnber with some registry or with the manufi«turer of the paper itself TTiat is, we look up in the 
registry that a paper using universal embedded codes 4.7,1 1.12,15.19,21.26.27.2834.35.37.38.40 
and 48. and having the embedded code 0110 0101 0111 0100 belongs to Leonardo de Boticem an 
unknown wildlife photographer and glacier dnematogtapher whose address is in Northern Canada. 
We know this because he dntifiilly registered his film and paper stock, a few mimrtes of work 
when he bought the stock, which he plopped into the "no postage necessary- envelope that the 
manufacturing company kindly provided to make the process ridiculously simple. Somebody owes 
Uonardo a royalty check it would appear, and certainly the r^stty has automated this royalty 
payment process as part of its services. 

One final point is that truly sophisticated pirates and others with illicit intentions 
can mdeed employ a variety of ayptogtaphic and not so ciyptographic methods to crack these 
miiversal codes, sell them, and make software and hardware tools which can assist in the removing 
or distorting of codes. We shall not teach these methods as part of this disclosm^ however. In 
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any event, this is one of Hie prices which must be paid for the ease of universal codes and the 
q>plications th^ open up. 

Tfa^v^i < pnHes- 7^ Univf^al Codes Based on Detenninistic Fo rmulas 

llje libraries of universal codes require the storage and transmittal of Megabytes 
of independent, generally random data as the keys with which to unlock the existence and identity 
of signals and imagery that have been maiked with universal codes. Alternatively, various 
deterministic fdimulas can be used which -generate- what appear to be random datafimage frames, 
thereby obviating the need to store all of fliese codes in memory and interrogate each and of the 
"50" universal codes. Deterministic formulas can also assist in speeding up the process of 
determining the ED code once one is known to exist in a given signal or image. On the other 
hand, detenninistic formulas lend themselves to sleulliing by less sophisticated pirates. And once 
sleuthed, they l«id themselves to easier communication, such as posting on the Internet to a 
hundred newsgroups. There may well be many applications which do not care about sleuthing and 
publishing, and deterministic formuias for generating the individual universal embedded codes 
15 might be just the tidcet. 

TTniversal Codes: 3) "Simple" Un iversal Codes 

TOs category is abtt of a hybrid of the first two, and is most directed at truly 
large scale implementations of the principles of this technology. The appUcations employing this 
class are of the type where stamich security is much less important than low cost, large scale 
20 implementation and the vastly larger economic benefits that this enables. One exemplary 

application is placement of identification recognition units directly within modestly priced home 
audio and video instrumentation (such as a TV). Such recognition units would typically monitor 
audio and/or video looking for these copyright identification codes, and thence triggering simple 
decisions based on the findings, such as disabling or enabling recording capabUities, or 
25 incrementing program specific bUling meters which are transmitted back to a central audio/video 
service provider and placed onto monthly invoices. Likewise, it can be foreseen that "black 
boxes" in bars and other pubHc places can monitor (listen with a microphone) for copyrighted 
materials and generate detailed reports, for use by ASCAP. BMI, and the like. 

A core principle of simple universal codes is that some basic industry standard 
30 -noiselike- and seamlessly repetitive patterns are injected into signals, images, and image 

sequences so that inexpensive recognition units can either A) determine the mere existence of a 
copyright "flag", and B) additionally to A, determine precise identification information which can 
fecUitate more complex decision making and actions. 

In order to implement this particular embodiment of the present invention, the 
basic principles of generating the individual embedded noise signals need to be sfanplified in order 
to accommodate inexpensive recognition signal processing drcuitiy. while mamtaining the 
properties of effective randomness and holographic permeation. With large scale industry adoption 
of these simple codes, the codes themselves would border on public domam information (much as 
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cable scnunblfag boxes are almost de fecto pubUc domain), leaving the door open for detennined 
pirates to develop black market countermeasnres, but this situation wonld be quite analogous to the 
scrambling of cable video and the objective economic analysis of such illegal activity. 

One prior art know, to the applicant in this general area of pro-active copyright 
detection is the Serial Copy Management System adopted by many fmns in the audio industry. 
To the best of applicant's knowledge, this system employs a non-audio "flag- signal which is not 
part of the audio data stream, bw which is neverthdess grafted onto the audio stream a^^ 
iiKlicate whether the associated audio data should or should not be dupB^ One problem with 
this system is that it is restricted to media and instnmientation which can support 
signal. Another deficiency is that the flagging system cames no identity information which would 
be useful in making more complex decisions. Yet another difBculty is that high quality audio 
sampling of an analog signal can come arbitrarily close to making a perfect digital copy of some 
digital master and there seems to be no provision for inhibiting this possibility. 

nie principles of this invention can be brought to bear on these and other 
problems, in audio applications, video, and all of the other applications previously discussed. An 
exemplary appUcation of simple universal codes is the following. A single industry standard 
"1.000000 second of noise" would be defined as the most basic indicator of the presence or 
absence of the copyright markmg of any given audio signal. Fig. 9 has an example of what the 
wavefi)nn of an industry standard noise second might look like, both in the tune d«^ 
the frequency domain 402. It is by definition a continuous function and would adapt to any 
combmation of sampling nrtes and bit quanitizations. It has a normalized amplitude and can be 
scaled arbitrarily to any digital signal amplitude. TTie signal level and (he first M'th derivatives of 
the signal a«; continuous at the two boundaries 404 (Fig. 9C). such that when it is repeated, the 
"break- in the signal would not be visible (as a waveform) or audible when played through a high 
end audio system, lie choice of 1 second is ari,itrary in this example, where the precise length of 
the interval will be derived from considerations such as audibility, quasi-white noise status, 
seamless repeatability, simplicity of recognition processing, and speed with which a copyright 
markmg determim«ion can be made. The injection of this repeated noise signal onto a signal or 
hnage (agam. at levels below hmnan perception) would mdicate the presence of copyright 
material. This is essentially a one bit identification code, and the embedding of fimher 
identification information will be discussed later on in this section. The use of this identification 
technique can extend far beyond the low cost home implementations discussed here, where studios 
could use the technique, and monitoring stations could be set up which litemlly monitor hundreds 
of chamiels of mfoimation sunultaneously, searching for marked data streams, and furthermore 
searching for the associated identity codes which could be tied in with billing networks and 
royalty tracking systems. 

Hiis basic, standardized noise signahn-e is seamlessly repeated over and over 
agam and added to audio signals which are to be mariced with the base copyright identification. 
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Part of the reason for the ^oid "Simple" is seen here: cleariy pirates vdll know about this industry 
standard signal, but their ifficit uses derived from this knowledge, such as erasure or corruption, 
will be economically mtauscule relative to the economic value of the overall technique to the mass 
matket For most high end audio this signal wiU be some 80 to 100 dB down from fuU scale, or 
even much further; each situation can choose its own levels though certainly there will be 
recommendations. The amplitude of the signal can be modulated according to the audio signal 
levels to which the noise signature is being applied, i.e. the amplitude can increase significantly 
when a drmi beats, but not so dramaticaUy as to become audible or objectionable. These 

measures merely assist the recognition circuitry to be described. 
) Recognition of the presence of this noise signature by low cost instrumentation 

can be efected to a variety of ways. One rests on basic modifications to the simple principles of 
amiio signal power metering. Software recognition programs can also be written, and more 
sophisticated mathematical detection algorithms can be applied to audio in order to make higher 
confidence detection identifications. In such embodiments, detection of the copyright noise 
5 signamre involves comparing the time averaged power level of an audio signal with the time 

averaged power level of that same audio signal which has had the noise signature subtracted from 
it If the audio signal with the noise signature subtracted has a lower power level tiiat the 
unchanged audio signal, then the copyright agnamre is present and some status flag to to 
needstobeset. The main engmeering subtleties mvolved in makmg this comparison mchide: 
,0 dealing with audio speed playback discrepancies (e.g. an instrument might be 0.5% "slow" relative 
to exactly one second intervals): and. dealmg with the unknown phase of the one second noise 
signature within any given audio (basically, its "phase" can be anywhere from 0 to 1 seconds). 
Another subtlety, not so central as the above two but which nonetheless should be addressed, is 
that the recognition circuits should not subtract a higher amplimde of the noise signature than was 
25 originally embedded onto the audio signal. Fortunately this can be accomplished by merely 

subtracting only a small amplitude of the noise signal, and if the power level goes down, this is an 
indication of "headmg toward a trough" in the power levels. Yet another related subtlety is m 
the power level changes will be very smaD relative to the overall power levels, and calculations 
generally will need to be done with appropriate bit precision. e.g. 32 bit value operations and 
30 accumulations on 16-20 bit audio in the calculations of time averaged power levels. 

Clearly, designing and packaging this power level comparison processing 
circuitry for low cost applications is an engineering optimization task. One trade-off will be the 
accuracy of making an identification relative to the "short-cuts" which can be made to the circmtry 
in onier to lower its cost and complexity. A preferred embodiment for the placement of this 
35 recognition circuitry inside of instrumentation is tiuough a single programmable integrated circuit 
which is custom made for the task. Fig. 10 shows one such integrated circuit 506. Here the audio 
signal comes in, 500. either as a digital signal or as an analog signal to be digitized inside tiie IC 
500. and the output is a flag 502 wWch is set to one level if the copyright noise signature is 



wo 95/14289 



PCTAJS94/13366 



-36- 



found, and to another level if it is not found. Also depicted is the fact that the standa„«zed noise 
signature wavefonn is stored in Read Only Memoir. 504. inside the IC 506. n.ere wfll be a 
shght tune delay between the application of an audio signal to the IC 506 and the outpm of a 
valid flag 502. due to the need to monitor sotne finite portion of the audio befo« a ^cognition 
> can place, m this case, there need to be a "flag valid" output 508 whe« the IC Monns the 
extanal world if it has had enough time to make a proper detennination of the p«sence or 
abseoce of the copyright noise signature. 

'^"^"^^^''^^^yof^ficdesignsandphilosopWesofdesignsapplied 
to accomphshn^ the basic fl,n«i«, of the IC 506 of Fig. ,0. Audio engheers and digital signal 
processmg engineers a. able to generate several fnndamentaUy difB«nt designs. One such design 
.sd^rctedinFig. 1, by a process 599. which itself is subject to fi^er engineering opting 
as wni e discussed. Fig. 1 , depicts a flow ^ for any of: an analog sign^ pr^g „etworlc. 
a digital signal processing network, or pn,8ranuning steps in a soitwa« progn«n. We find an 
".put signal 600 which along one path is applied to a time aven.ged power meter 602. and the 
rmlting power output itself treated as a signal P,, To the upper right we find the standard noise 

504 which Will be read out at 125-/. of normal speed, 604. thus changing its pitch 
gnong thOhch dumged noise sigmU" 606. Iben the input signal has this pitch changed noise 
signal subtracted in step 608. and this new signal is applied to the same form of time averaged 

^ l^Ued as P_ 610. Step 612 then subtracts the power signal 602 fiom the power signr 
610. grvmg an output difference signal 613. If the universal standard noise signature does 
mdeed exist on the input audio signal 600. then «se 2. 6,6, wiB be created wherein a beat «gmd 
618 of approximately 4 second period will show up on the output signal 6,3. and it remains to 
d^ect thrs beat signal with a step such as in Fig. ,2. 622. Case 1. 614. is a steady noisy signal 
whrch exhibits no periodic beating. ,25% at step 604 is chosen arbitrarily here where 
^ginecring considerations would determine an optimal value, leading to different beat signal 
fi^iuencies 6,8. Whereas waiting 4 seconds in this example would be quite a while, especially is 
you would want to detect at least two or three beats. Fig. ,2 outiines how the basic design of Fig 
11 could be repeated and operated upon various delayed versions of the input signal, delayed by " 
something like l/20th of a second, with 20 parallel circuits woridng in concert each on a segment 
of the audio delayed by 0.05 seconds from their neighbors. In this way. a beat signal will show 
up approxm^ately every l/5th of a second and wDl look like a travelling wave down the colmmrs 
of beat detection circuits. Tire existence or absence of this traveUing beat wave triggers the 
detection flag 502. Meanwhile, there would be an audio signal monitor 624 which would ensure 
^ for example, at least two seconds of audio has been heard before setting the flag valid signal 
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Though the audio example was described above, it should be clear to anyone in 
the art that the same type of definition of some repetitive universal noise signal or image could be 
appUed to the many other signals, images, pictures, and physical media already discussed. 

The above case deals only with a single bit plane of infonnation, i.e., the noise 
signature signal is either there (1) or it isn't (0). For many applications, it would be nice to detect 
serial number information as well, which could then be used for more complex decisions, or for 
logging information on billing statements or whatnot Hie same principles as the above would 
apply, but now there would be N indepmdent nobe sigmitures as depicted in Fig. 9 instead one 
single such signature. TVpically, one such signature would be the master upon whidi Ae mere 
existence of a copyright marking is detected, and this would have generally higher power than the 
others, and then the other lower power -identification" noise signatures would be embedded into 
audio. Recognition circuits, once having found the existence of the primary noise signature, would 
then step through the other N noise signatures applying the same steps as described above. Where 
a beat signal is detecte4 this indicates the bit value of '1'. and where no beat signal is detected, 
this indicates a bit value of '0'. It might be typical that N will equal 32, that way 2« number of 
identification codes are available to any givoi industry employing this invention. 
T T .^ »f this Technology When the Len pth of the Idenrification Code is 1 

The prindples of this inviention can obviously be ^lied in the case where only 
a single presence or absence of an identification sigmd - a fingeiprint if you wiU - is used to 
20 provide confidence that some signal or nnage is copyrighted. The example above of the industry 
standard noise signature is one case in point. We no longer have the added confidence of the coin 
flip analogy, we no longer have tracking code capabilities or basic serial number capabilities, but 
many applications may not require these attributes and the added simplicity of a single fingerprint 
might outweigh these other attributes in any event. 
25 The "Wallpaper" Analogy 

The tcnn "hologr^hic" has beoi used in this disclosure to describe how an 
identification code number is distributed in a largely integral form throughout an encoded signal or 
image. Tbis also refers to the idea that any given fragment of the signal or image contains the 
entire miique identification code number. As with physical implementations of holography, there 
are limitations on how small a fragment can become before one begins to lose this property, where 
the resohition limits of the holographic media are the main factor in this regard for holography 
itself. In the case of an uncomipted distribution signal which has used the encoding device of 
figure 5, and which fiirthermore has used our "designed noise" of above wherein the zero's were 
randomly changed to a 1 or -1, then the extent of the fragment required is merely N contiguous 
samples in a signal or image raster line, where N is as defmed previously being the length of om: 
identification code number. This is an informational extreme; practical situations where noise and 
corruption are operative will require generally one. two or higher orders of magnitude more 
samples than this simple number N. Ihose skilled in the art will recognize that there are many 
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variables involved in pinning down precise statistics on the size of the smaUest fagment with 
which an identification can be made. 

For tutorial pmposes, the applicant also uses the analogy that the unique 
identification code number is Valipapered" acn,ss and image (or signal). TTiat is. it is repeated 
over and over again all throughout an image, m repetition of the ED code nmnber can be 
regular, as in the use of the encoder of figure 5. or random itself where the bits in the ID code 
216 of figure 6 are not stepped through in a noimal repetitive fashion but rather are randomly 
selected on each sample, and the random selection stored along with the value of the omput 228 
Itself in any event, the infonnation carrier of the ID code, the mdividual embedded code signal 
does change across the image or signal. THus as the wallpaper analogy summarizes: the ID code 
repeats itself over and over, but the pattenu. that each ^petition imprints change nmdomly 
accordingly to a generally unsleuthable key. 
Lossy Data Comnressinn 

As earlier mentioned, the identification coding of the prefened embodiment 
withstands lossy data compression, and subsequent decompression. Such compression is finding 

mcreasing use. particularly in contexts such as the mass distribution of digitized entm^ 
programmmg (movies, etc). 

While data encoded according to the piefened embodiment of the present 
mvention can withstand all tjpes of lossy comp«sion known to applicam. those expected to be 
most commercially important are the CCITT G3. CCTTT G4. JPEG. MPEG and JBIG 
compression/decompression standards. The CCITT standartis ar« widely used in black-and-white 
document compression (e.g. facsimUe and document-storage). JPEG is most widely used with still 
miages. MPEG is most widely used with moving images. JBIG is a likely successor to the 
CCnr standards for use with black-and-white imagery. Such techniques are well known to those 
m the lossy data compression field; a good overview can be found in Pemiebaker et al. JPEG. Still 
Image Data Compression Standard, Van Nostrand Reinhold, N.Y., 1993. 
Towards Stepanopnphy Proper and the The of thi. T..hn...^ , Jf^,,i„, ^n,. r^^p.,. 
Messages or Information 

Ihis disclosure concentrates on what above was called wallpapering a single 
identification code across an entire sigmd. This appears to be a desirable feature for many 
applications. However, there are other applications where it might be desirable to pass messages 
or to embed very long strings of pertinent identification information in signals and images. One 
of many such possible applications would be where a given signal or image is meant to be 
manipulated by several dififerem groups, and that certain regions of an image ar« ,«erved for each 
group's identification and insertion of pertinent manipulation information. 

In these cases, the code word 216 m figure 6 can actually change in some 
pre-defined mamrer as a function of signal or unage position. For example, in an image, the code 
could change for each and every raster line of the digital image. It might be a 16 bit code word 
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216 but each scan line would have a new code word, and thus a 480 scan line image could pass a 
980 (480 X 2 bytes) byte message. A receiver of the message would need to have access to ether 
the noise signal stored in memory 214. or would have to kuow the universal code structure of the 
noise codes if that mediod of coding was being used. To the best of applicant's knowledge, this .s 

5 a novel ^roach to the mature field of steganography. 

In all three of the foregomg applications of universal codes, it wiU often be 
desirable to append a short (perhaps 8- or 16-bit) private code, which users would keep in their 
own secured places, in addition to the univenal code. This affords the user a farther modicum of 
security against potential erasure of the universal codes by sophisticated pirates. 

10 Conclusion 

In view of the great mmiber of different embodiments to which the principles of 
my invention can be put. it should be recognized that the detailed embodimentt are illustrative 
only and should not be taken as Imiitmg the scope of my invention. Rather, 1 claim as my 
invention all such embodiments as may come within the scope and spirit of the following claims. 
15 and equivalents thereto. 
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APPENDIX A 

#include "main.h" 

#define XDIM 512L 
#define XDIMR 512 
#define YDIM 4B0L 
#def ine BITS 8 
#define RMS_VAL 5.0 
#define NDM_NOIsy 16 
#define NUM_DEMOS 3 
#define GRAD^THRESHOLD 10 

struct char__buf { 

char filename [80] ; 
FILE *fp; 
fpos_t fpos; 
char buf [XDIMR] ; 

struct uchar_buf { 

char filename [80] ; 
FILE ♦fp; 
fpos_t fpos; 

unsigned char buf [XDIMR] ; 

struct int_buf { 

char filename [80] ; 
FILE *fp; 
fpOs_t fpoS; 
int buf [XDIMR] ; 

struct cortex_6 { 

char filename [80] ; 
FILE ♦fp; 
fpos_t fpos; 

unsigned char buf [XDIMR] ; 



struct uchar_buf test_image; 
struct char_buf snow^composite ; 
struct uchar_buf distributed image; 
struct uchar_buf temp_image;'" 
struct int_buf temp_wordbuf fer; 
struct int_buf temp_wordbuf f er2 • 
struct uchar_buf snow_images; 
struct cortex_s cortex; 

int demo^o; /* which demo is being performed, see notes ♦/ 

int our code; /* id value embedded onto image */ 
int found^code=0; /* holder for found code*/ 



int waitvbb (void) { 

while ( (_inp (PORT_BASB) &8) ) ; 
while ( J (_inp(PORT BASE)&8) ); 
return (1); " ' 



int grabb (void) { 
waitvbb 0 ; 

_outp (P0RT_BASE+1, 0) ; 
_outp (PORT_BASE , 8 ) ; 
waitvbb 0 ; 
waitvbb 0; 

_outp(PORT_BASE, 0x10) ; 
return (1) ; 
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} 

int livee(void) { 

_OUtp(PORT_BASE,OxOO) ; 

return (1) ; 

} 

int live_video(void) { 
liveeO ; 
return (1) ; 

} 

int f reeze_f rame (void) { 
grabbO ; 
return (1) ; 

} 

int grab^frame (struct uchar^buf * image ) { 

long i; 

grabbO; . _ . 

fsetpos(iinage->fp, &xmage->fpos ); 
f setpos (cortex. fp, &cortex. fpos ) ; 

'°"^^;e;d(™x^^^^ ;^^"?Sae*3i{ • 

fwrite (cortex.buf , sizeof (unsigned char) ,XDIMR, iinage->fp) , 

liveeO ; 
return (1) ; 

} 

int wait_yerticaljDlanks (int number) { 

for U=0;i<number;i++)waitvbb() ; 
return (1) ; 

} 

int clear_char_iTnage (struct char__buf *charbuf f er) { 

long i,j; 
char *pchar; 
fpos_t tmp^fpos; 

f setpos (charbuffer->fp, &charbuf f er- >f pes ); 
for (i=0 ; i<YDIM; i++) { ^ . 

fgetpos(charbuffer->fp, &tmp_fpos ); 

lr^d(chaJ^3fe?>b3?8U .XDIMR.charbuffer->fp) ; 

for(j=0;j<XDIM; j++) * (pchar++) « 0; 

fsetpos (charbuffer->fp, &titqc)_fpos ); ^ ^ - . 

fwrite1cLrbuffer->buf ,sizeof ^ ,XDIMR, charbuf fer->fp) ; 

return (1); 

1 

int display^uchar (struct uchar_buf *image,int stretch) { 

unsigned char *pimage; 

xmsigned char highest = 0; 

unsigned char lowest = 255; 

long i/j; , *. 

double dtemp, scale, dlowest; 

fpos_t tmp_fpos; 

if (stretch) { . * \ 

f setDOS (iinage->fp, &xmage->rpos . 

f read Umage:>buf: sizeof (unsigned char) , XDIWR, image- >fp ; 

fread(iSe->buf,sizeof (unsigned char) ,XDIMR, image- >fp) ; 
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for(i=2;i<(YDIM-2) ;i++) { 

for(j=3;j<(XDIM-3) ; j++) { 

if j *piinage > highest ) highest - *piinage; 

} ^ 

if (highest == lowest ){ 
contrast-); P'^^'^" < "^^'"^thing wrong in contrast stretch, zero 
^ exit(l); 

fgetpos(iinage->fp, &tinp fpos )• 

,XDXMR,i™age->fp, , 

for(j=0;j<XDIM;j++){ 

dteijp . ( (double) *pimage - dlowest) *scale- 

if(dtenip < 0.0)*(piniage++) = O; 

till "<'?'=«™P > 255.0) Mpiinage++) = 255; 
^ else Mpiinage++) = (unsigned char) dtemp,' 

fsetpos( image- >fp, &tmp fpos ); 
char),XDiMR,f;^^^!i|J?f->^"^'«i"<='?(«n«i9ned 

) ^ 

fsetpos (iinage->fp, &iinage->fpos ) • 
fsetpos (cortex. fp, ficortex.fpos )' 
for(i=0;i<YDIM;i++){ ^ 

return (1); 



^""^ ^"""-^"^ *wordbuffer){ 



int *pword; 
fpos_t tinp_fpos; 



); 



} 



forS^o/r^SJ^jff::)!'^' — <^«er->fpos 

&f 'SSif . ^^-^p- > ' 

fsetpos (wordbuffer->fp, stmp foos ')• 
^ fwrate(wordbuffer->buf ,sizeof (Lt) ,kMR,wordbuffer->fp, ; 
return (1); 



'*°"^ionfijr^"-'°*''^*'"'°*' *wordbuffer){ 
int *pword; 
double meaneO.O; 

fsetpos (wordbuf fer«>fp, fiwordbuf f er- >f pos ) ; 
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for(i=0;i<YDIM;i++) { 

"Dword = wordbuf f er->buf ; . 
fread(wordbuffer.>buf, sizeof (int) ,XDIMR,wordbuf f er.>fp) ; 
for(j-0;j<XDlM;j++) tnean += (double) *(pword++); 

lean /= ( (double) XDIM * (double) YDIM) ; 
retiim (mean) ; 

} 

int add_uchar_to^int (struct uchar^buf *image, struct int^buf *word) { 
unsigned char *pimage; 
int *pword; 
long i,j; 
fpos_t ttrqp_fpos; 

f setpos ( image- >fp, &image->fpos ); 
f setpos (word- >fp, &word->fpos ); 
for(i=0;i<YDIM;i++) { 
pword = word->buf ; 

fgetpos(word->fp, &tmp f pos ); ^ * x 

fread (word- >buf, sizeof Tint) ,XDIMR,word->fp) : 

fSumaS^>bu?^^ char) mil^,image->fp) ; 

for(j=0;j<XDIM;j++) * (pword++) +- (int) * (pxmage++) ; 
f setpos (word- >fp, 5ctrap_fpos ); , xr x 

fwrite (word->buf, sizeof (int) ,XDIMR,word->fp) ; 

return (1) ; 



int add^char^to^uchar^creating^uchar (struct char^buf *ciinage, 
struct uchar_buf * image, 
struct uchar_buf *out_image) { 
unsigned char ♦pimage, *pout_image; 
char *pc image ; 
int temp; 
long i,j; 

f setpos (image->fp, &image->fpos ); 

f setpos (out_image->fp, &out_image->fpos }; 

fsetpos(cimage->fp, &cimage->fpos ); 

for(i=0;i<yDIM;i++) { 

Dcimage = cimage->buf; ^ 
fread(cimage->buf, sizeof (char) ,XDIMR,cimage->fp) ; 

f S U^ge- >bu?si^ (unsigned char) ,XDIMR, image->fp) ; 
pout_image = out_image->buf ; 

for (i=0; j<XDIM; j++) { . , . \ 

tem^ = (int) * (pimage++) + (int) * (pcimage++) ; 
if (temp<0) temp = 0; 
else if (temp > 255)tentp = 255; 
* (pout_image++) = (unsigned char) temp; 

fwrite (out__image->buf , sizeof (unsigned 
char) , XDIMR, out_image- >f p) ; 

return (1) ; 

} 

int copy_int_to_int (struct int_buf *word2 , struct int_buf *word) { 

long i; 

f setpos (word2->fp, &word2->fpos ); 
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} 



fsetpos(word->fp, &word->fpos ); 
for(i=0;i<YDIM;i++) { 

fread(word->buf ,si2eof (int) ,XDIMR, word->fp) ; 
^ fwrite(word->buf ,sizeof (int) ,XDIMR,word2->fp) ; 

return (1) ; 



void get_snow_images (void) { 

unsigned char *psnow, *pteinp; 
int nuinber__snow_inputs ; 
int temp, *pword,*pword2, bit; 
long i, j; 

double rms,dteinp; 

live_video(); /* device specific */ 

Sn«{n\^^^^!^^"^r^?^ ^ "^^^i^ lit Wank wall ") - 

printf ("\nDefocus the lens a bit as well ") - ' 

and possible, place the camera into its highest gain, 

printf ("\nput the gamma to 1.0. »); 
printf Ensure that the video is not saturated ") • 
printf {"\nPress any key when ready... „,^''''"^®^ 

while ( IkbhitO ); 

printf ("\nNow finding difference frame rms value... «)- 

lIve^SeooT^ ^"^^^ ^^^"^ ^^^"^^^^ ^^^^ ™« difference */ 

wait_vertical_blanks (2) ; 

grab_frame(&teinp image); 

live_video(); 

wait_vertical blanks (2); 

grab_frame{&dlstributed_image); /* use first image as buffer */ 
rms B 0.0; 

fsetpos (temp image. fp, &temp image. fpos ); 

lor^SJL'SSS::?!'""^-^^' ^ciistr?.utehmage.fpos ,; 

ptemp o terap_image.buf ; 

nh^r^^ „^"ad(terap_image.buf , sizeof (unsigned 
char) , XDIMR, temp_image . f p) ; 

psnow » distributed_image.buf ; 

char^ vnT^^^l^'^^^^^^^'^^^^-^^^^^-^^f'Sizeof (unsigned 
Char) , XDIMR, distributed_image . f p) • 

for(j=0;j<XDIM;j++} { 

temp = (int) * (psnow++) - (int) * (DtemD++l . 
dtemp = (double) temp; tptemp++} , 

dtemp *« dtenp; 
rms +« dtemp; 

} 

rms /= ( (double) XDIM ♦ (double) VDIM ) • 
rms = sqrt (rms) ; ' 

found!'-?^i)")"^^ ''^""^ difference noise value of %lf v«s 

measure". RMS VAL) • 

/* we want rms to be at least RMS VAL DN, so •**^/al) , 
If (rms > RMS_VAL) number_snow_inpiIts - i 
exse I 

dtemp ^ RMS__VMi / rms; 
dtemp *a dtemp ; 
^ number_snow_inputs » i + (int) dtemp; 

printf ("\n%d images will achieve this noise 
level " , number_snow_inputs) ; 
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/* now create each snowy image */ 

printf ("\nStarting to create snow pictures... 
fsetpos(snow_itnages.fp, &snow^images . f pes ); /* set on first 

"■"^^f or (bit = 0; bit < BITS; bit++) { 

clear_int_image(&teinp_wordbuffer) ; 
for ( i=0 ; i<nuTnber_snow_inputs ; [ 

live_video{) ; 

wait_vertical_blanks(2) ; 

qrab f rame (&temp_image) ; «*^^\ 
add Schar_to_int (fctemp^image, &terap_wordbuf f er) ; 

} 

clear int_image ( &ten^_wordbuf f er2 ) ; 
for ( i = 0 ; i <number_snow_input s ; i++ ) { 

live_video() ; 

wait_vertical_blanks(2) ; 

grab frame (&temp_iinage} ; . 
add Schar_to_int(&temp_image,&temp„wordbuffer2} , 

) 

/* now load snow^images[bitl with the difference frame 
biased by^^ ^ unsigned char form just to keep things clean */ 
/* display it on cortex also */ ^ * ^« \ 

fsetpol(temp wordbuf fer2 .fp, &temp_wordbuf f er2 . fpos ) ; 
fsetpos(tempIwordbuffer.fp, &temp wrdbuf f er. fpos ), 
f setpos ( temp_image . f p , &temp_image . f pos ) ? 
for(i=0;i<YDIM;i++) { . 

ffer2.fp); . , . 

psnow - snow__images . but ; 

ptemp = teTnp_image.buf ; 

for (i=0; j<XDIM; j++) { . ^ ^ * 

*{psnow++) i *{ptemp++) = (unsigned char) 
(*(pword++) - *(pword2++) + 128); 

f write (snow_images .buf , sizeof (unsigned 

char) , XDIMR, snow images . f p) ; . . 

f write (temp^image .buf . sizeof (unsigned 

char) , XDIMR , temp_image . f p) ; 

£s|lIy!u^rUtemp_i«age,0); /*! signifies to stretch the 

contrast*/ . _ „ u4 4-\ . 

printf ("\rDone snowy %d ",bit); 
wait vertical_blanks(30) ; 

} 

return; 



void loop_visual (void) { 
unsigned char *psnow; 
char *pcoinp; 
long i,j, count » 0; 
int ok=0,ten)p,bit,add_it; 
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double scale = 1.0 / RMS VAL; 
aoiible dtemp, tmpscale; 
fpos_t tmp_^fpos; 

need'^to ^"^^ ^ composite should be approximately i dn, thus we 
do thf^* ^ approximately 5*BITS as a first guess, then 

visual loop to zoom in on final acceptable value ♦/ 

'"«<"\«\° NOW calculating initial guess at amplitude... 
while ( lok ) { 

/♦ calculate snow_composite */ 
/* clear composite ♦/ 

clear_char_image(&snow_con5)osite) ; 
first iJ|^^f^<«^*-i««ges.fp, &snow_in«ges . f pos ); /* set on 

for(bit=0;bit<BITS;bit++) / 
j » 128 » bit; 
if ( our_code & j)add it=l; 
else add_it=0; 

lort^oferS^rt^^-^P' *-ow_composite.fpos ,; 
psnow » snow_iinages.buf ; 
Char) ,XDIMR, snowS^girfJrt"^" ir^i^e^i 

f getpos (snow_composite . fp, &tmp_f pos ) ; 

fread(snow_composite^buf sizeof (char) ,XBlMR.snow composite.fp) ; 

pcomp = snow_composite.buf • ^ ^^"^c-tp/ , 
for(j-0;j<XDIM;j++) { 

f?(^e;^io'^OH'"^*'^^"°"**' 

^ temp « -(int) fabs( -dtemp +0.5); 

else { 

J temp » (int) fabs{ dtemp +0.5); 

if (temp > 127) { 
^ temp = 127; 

else if (temp < -128) { 
^ temp = -128; 

if {add_it){ 

^ *{pcomp++) (char) temp; 
else { 

*(pcomp++) (char) temp; 

} 

f8etpos(snow__composite.fp, &tmp_fpos ); 
f write (snow_composite .buf , sizeof (char) , XDIMR, snow_composite . fp) ; 
^ printf ("\rDone snowy %d ",bit); 

" ^« «/ 

&snow_composi t e , 
&test_image. 
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fidistributed^image) ; 

/* display both and cue for putting scale down, up or ok */ 

p^^Snn^Depress any key to toggle, enter to move on...\n 

printf("\r Distributed Image. . . "); 
display uchar ( &distributed_xmage , 0 ) ; 
whileC getchO 1= '\^' ){ 
if ( (count++) % 2) ( 

printf("\r Distributed Image.,. »•); 
display_uchar(&distributed_image,0) ; 

ilse 

^ ^ printf("\r Original Image... ") ; 

display^uchar ( &tes t^image , 0 ) ; 

Jrintf ("\nScale = %lf %scale); 

printf ("\nEnter new scale, or >le6 for ok... >, 

scanf ("%lf "/&tmpscale) ; 

if (tnqpscale > le6)ok=l; 

else scale = tmpscale; 

/* distributed image now is ok, calculate actual snow.images 

used and , 
store in those arrays; */ 

fsetpos(snow^images.fp, wnow^images , f pos ); /* set on first 

^"^^printf ("\nNow storing snow images as used... \n") ; 
for(bit=0;bit<BITS;bit++) { 
for(i=0;i<YDIM;i++) { 

psnow = snow_images.bur ; 
fgetpos(snow_images.fp,, &tmp_fpos ); 
fread(snow_images.buf,sizeof (unsigned 

char) , XDIMR , snow_images . f p) ; 

for(i=0; j<XDIM;3++) { . ^ -, 

dteic^ = ( (double) *psnow -128.0) * scale,- 
if (dtemp<0.0) { ^ ^^ 

temp « -(int) fabs( -dtemp +0.5); 

} , 
else \ 

temp « (int) fabs( dtemp +0.5); 
Mpsnow++) = (unsigned char) (temp + 128); 

f setpos (snow_images . f p, &tmp f pos ) ; 
f write (snow^images .buf , sizeof (unsxgned 
char) , XDIMR , snow_images . f p) ; 

printf ( " \rDone snowy %d " , bit ) ; 

} 

return; 

} 



double find grad (struct int_buf * image, int load^buffer2) { 
int bSuroiMR] ,buf2 [XDIMR] ,buf 3 [XDIMR] ; 



int *pbufl,*pbuf2,*pbuf3,*p2; 
double total=0.0,dten^; 
long i, j; 
fpos_t tmp_j)OS; 
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fsetposanage->fp, &iinage->fpos ); 
fgetpos (image- >fp, &tmp_pos ) ; 

fsetpos(temp_wordbuffer2.fp, &temp_wordbuf f er2 . fpos ); 

fOr(l>i;i<(VDIM-l) ;i++) { 

fBetpoB(image->fp, &tmp_pos ); 
£read{bufl,si2eof (int) ,XDIMR, image- >fp) • 
fgetpos (iniage->fp, stmp_pos ); ^' ' 

fread(buf2,sizeof (int) ,XDIMR, image- >fp) , • 

pbuf2=buf2i 
pbuf3abuf3; 

p2 =. temp_wordbuffer2.buf ; 

if {load buffer2){ 

forTj=l/j<(XDIM-l) ;j++) { 

dterap = (double) * (pbufl++) ; 

dtemp += (double) * {pbufi++} ; 

dtemp +s= (double) * {pbuf i--) • 

dtemp += (double) *(pbuf 2++) ' 

dtemp -= (8.0 * (double) *(pbuf2++)); 

dtemp += (double) * (pbuf2--) ; 

dterap += (double) * (pbuf 3++) ■ 

dtemp (double) * (pbuf 3++) ; 

dtemp (double) * (pbuf 3--) • 

*p2 = (int)dteiip; 

if ( *p2 > GRAD_THRESHOLD ) { 

^ *(p2++) -= GRAD_THRESHOLD; 

else if( *p2 < -GRAD^THRESHOLD ){ 
^ *(p2++) +a GRAD_THRBSHOLD; 

else { 

*(p2++) = 0; 

} 

fwrite(tenp.wordbuffer2.buf ,sizeof (int) ,XDIMR, temp.wordbuffer2,fp) ; 
else { 

dten^ s (double) * (pbuf l++) ; 
dten^ += (doxible) * (pbuf 1++) ; 
dtemp +s (double)* (pbuf 1--) ; 
dtemp += (double) * (pbuf 2++) ; 
dtemp (8.0 * (double) * (pbuf 2++) ) ; 
dtemp += (double) * (pbuf 2--) ; 
dtemp += (double) * (pbuf 3++) ' 
dtemp += (double)* (pbuf 3++) i 
dtemp += (double ) * (pbuf 3 - - ) ' 

dtemp -= (double) * (p2++) ; 

dtemp *a dtemp; 
^ total += dten^); 



^ return ( total ) ; 
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void search^l (struct uchar^buf *suspect) { 

imsigned char *psuspect, *psnow; 

int bit, *pword, temp; 

long i/j; ^ . 

double add_me tr ic , subtract_metric ; 

fpos_t tTtip_fpos; 

/* this algorithm is conceptually the simplest. The idea is to 
^^^^through each bit at a time and merely see if adding or 

^"^^I^SiiduSfsnowy picture minimizes some 'contrast' njetric 

iSis should be the most crude and inefficient, no where to go 

but 

better */ 

f setpos ( snow_images . f p , &snow_images . f pos ) ; 

temp=256; 

clear int image (&temp_wordbuffer ) ; 

add uchar'to int (suspect, &tenq?wordbuffer) ; ^a^^.^^c^o 
findl^gradT&timp.wordbuffer,!); /* 1 means load temp.wordbuf fer2 

^ for (bit=0 ;bit<BITS ;bit++) { 
/* add first */ 

f ge tpos ( snow__images . f p , &tmp_f pos } ; 
f setpos (suspect->fp, &suspect->fpos > ? . ^ . 

f setpos (teTnp_wordbuf f er . f p, fctemp^wordbuf f er . f pos ) ; 
for(i=0;i<YDIM;i++) { 

pword = teinp_wordbuffer.buf ; 

psuspect = suspect ->buf; 

psnow = snow_images.buf ; 

f read (suspect ->buf , sizeof (unsigned 

char) ,XDIMR, suspect- >fp) ; « • /„„«4««o<i 

f read (snow_images .buf , sizeof (unsigned 

char) , XDIMR, snow_images . f p) ; 

^MpwrSi7= Unt) * (psuspect++) + (int) * (psnow++> -128 ; 
f write ( temp^wordbuf f er .buf , sizeof (int ) , XDIMR, temp_wordbu 

ffer.fp); 

idd^metric = f iiid_grad(&temp_wordbuf fer, 0) ? 
/* then svibtract */ 

f setpos ( snow_images . f p , &tnip_f pos ) ; 
f setpos ( suspect- >fp, &suspect->fpos ); * « v 

f setpos ( terap^wordbuf f er . f p , «ctemp_wordbuf f er . f pos ) ; 
for(i=0;i<YDIM;i++) { 

pword = teinp_wordbuffer.buf ; 

psuspect = suspect- >buf; 

psnow = snow_images.buf ; 

f read (suspect- >buf, sizeof (unsigned 

char) , XDIMR, suspect->fp) ; , ^ . ^/ . 

f read ( snow^images . buf , sizeof (unsigned 

char) , XDIMR, snow_images . f p) ; 

for (i=0: j<XDIM; I ^ , x -.oo 

* (pword++) » (int) * (psuspect++) - (int) * (psnow++) +128 ; 

fwrite (temp^wordbuffer .buf , sizeof (int) , XDIMR, teinp_wordbu 

ffer.fp); 

lubtract__metric = f ind3rad(&temp__wordbuf f er, 0) ; 

printf ("\nbit place %d: add»%le , 
sub=%le" , bit , add_metric, subtract^metric) ; 

temp/=2; ^ j x r 

if (add__metric < subtractjmetric) I 
printf (" bit value - 0"); 
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else { 

printf(" bit value = 1") ; 
found_code += temp; 

} 

?etSm^"^''^''^°'''' '^^^'^ number was %d", found code) ; 



void search_2 (unsigned char *suspect) { 
if (suspect) ; 
return; 



} 



void loop_simulation(void) { 

unsigned char *ptemp, *pdist; 
int *pword,int_mean,ok«iO,temp; 
long i,j; 

double mean, scale; 

iZ^llf'inx °°isy image into one of the temp buffers */ 
printf ( "\ngrabbing noisy frame . , . \n») ; ^^^^^rs / 
clear_int_image(&temp wordbuf fer) ; 
for(iaO;i<NDMJlOISY;i^.+) ( 

live_video(); 

wait__yertical_blanks (2) ; 

grab_frame(&temp_image) ; 

jftrSNSM°Nj?gr'^-'"«^*' ^te-^-Wordbuf fer) ; 
^ print£("\r%ld of %ld 

/* find mean value of teinp_wordbuf f er */ 
mean = f ind_mean_int (fitemp wordbuf fer) • 
int_mean - {int)mean; ' 

dist^Il^^^d"^ ^"^^^ 'corruption' to our 

image */ 
scale =1.0; 
while ( !ok ) { 

f«t™«?H,-ff ^i^t image storing in temp image */ 

foo^S^f ^ ""^'''^^^-^"^^^•^P' ^distributed imaie.fpos )• 
fsetpos temp^wordbuffer.fp, &temp wordbuffir!flos )^ 
fsetpos{temp^image.fp, &temp imagi.fpos 
for(i=0;i<YDIM;i++){ ~ ^ ^ ^' 

pdist = distributed_iraage.buf ; 

pword = temp_wordbuffer.buf ; 

ptemp = temp_image . buf ; 

Char, ,XDI,«. JlI?^i£L'rLlfTf??r-''"''''"°'^"^'^^'^ 
fer.fp, ; ^"^<*'<=«"'P-«o'^db'^«er.buf .sizeof (int) .XDlMR,temp_wordbuf 
for(j=0; j<XDIM; j++) { 

else if (temp > 255) temp = 255; 
^ *(ptemp++) r: (iinsigned char) temp; 
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f write { temp^image .buf , sizeof (unsigned 
char) , XDIMR, temp^image . f p) ; 

/* display the dist image and the corrupted image */ 
display_uchar ( &teinp_image , 0 ) ; 

/* apply new 'corrupted' image to search algorithm l for id 

value */ J . 

search_l(&tetnp_image) ; 

/* apply new 'corrupted' -image to search algorithm 2 for id 

value */ 

/* 

search_2 (temp_image) ; 
*/ 

/* prompt for upping noise content or ok */ 

ok = 1; 

} 

return; 

} 



int initialize^everything (void) { 
long i,j; 

unsigned char *pucbxir; 
char ♦pcbuf; 
int *pibuf; 

/* initialize cortex */ 
strcDV ( cortex . filename , " f : image" ) ; 
5 Ucortex fp^ (cortex. filename, "rb" ) ) ^^NOLL) { 
system ("V f g") ; 

else f close (cortex, fp) ; 

if( ( inp(PORT_BASE) « OxFF) ){ 

printf("oops •») ; 

exit{0) ; 

} 

/* open cortex for read and write */ vrmTU 
if ( (cortex. fp=f open (cortex . filename, "rb+« ) ) =«NOLL) { 

printf (" No good on open file ^oe j; 

exit(O) ; 

fgetpos (cortex. fp, &cortex.fpos ); 
/* test image; original image */ . „„x 

s^tsri««=t!£^ak»». ...... ( 

printf (" No good on open file Doe ) , 
exit(O) ; 

pucbuf = test_image.buf ; 
for (i=0 ; i<XDIM; i++) * (pucbuf ++) =0; 
for(i=0;i<YDIM;i++) { . ^a^^a 

fwrite (test^image.buf, sizeof (unsigned 

char) , XDIMR , tes t_image . f p) ; 

printf (" NO good on open file joe ); 
exit(O) ; 
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} 

fgetpos{test_iinage,fp, &test_iinage.fpos ); 

^ exit (0) ; J I , 

pcbuf = snow_coinposite.buf ; 

for (i=0;i<XDlM;i++) * (pcbuf++)»0; 

for(i=0;i<YDIM;i++) { 

fwrite(snow_composite.buf ,sizeof (Char) ,XDIMR,snow^conrposite.fp) ; 
f close {Bnow_coniposite . fp) ; 

^ exit (0) ; J ^ I , 

fgetpos(snow_oompo8ite.fp, fisnow_eon5>oBite.fpos ); 

^* *^trSwd?«ri?S'f^ .test_img plus snow composite */ 
strcpy(dastributed_image. filename, "e:dit_inig") ; 

i^((di8tributed_image.fp-fopen(distributed_image.fileaame.-wb.))«NU 

printf (" No good on open file joe ") • 
^ exit(O); 

pucbuf = distributed image. buf; 
for x=0 ; i<XDIM; i++) *Tpucbuf ++) =0 ; 
for(x=0;i<VDIM;i++) { 

charKxn?J^f5!i?^i^%^!^^-iJ;t-^'^ 
f close (distributed_iroage . f p) ; 

in{distributed_image.fp=f<^en(distributed_image.filename,-rb*^ 

printf (" No good on open file ioe -) • 
^ exit(O); 

fgetpos(distributed_image.fp, &distributed_image.fpos ); 

/* ten5>_image; buffer if needed */ 

strcpy(ten5)_image. filename, -e.-temp img") , 

pucbuf s temp_iinage,buf ; 

for (i=0;i<XDIM;i++) * (pucbuf ++) «0; 

for(i=0;i<YDIM;i++) { 

nh«^^ vnT^"^!^ (temp^itnage.buf , sizeof (unsigned 
char) , XDIMR , temp_image . f p) ; ^ 

f close ( temp^image . f p) ; 

^ exit(O) ; J = / . 

fgetpos(teiii)_image.fp, &temp_image . f pos ); 
/* temp_wordbuffer; 16 bit image buffer for averaging */ 
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strcpy ( teit^)_wordbuf f er . filename , - e : teitip^wrd" ) ; 

if ( (temp wordbuf f er . f p«f open (temp wordbuf f er . filename , " wb« ) ) -NULL) { 
printf (« No good on open file 3oe ; 
exit(O) ; 

pibuf « temp wordbuf fer.buf ; 
for (i«0 ; i<XDIM; i++) * (pibuf ++) «0 ; 
for(i»0;i<TOIM;i++) { 
fwrite(temp.wordbuffer.buf ,si2eof (int) ,XDIMR, temp^wordbuf fer .fp) ; 

fclose(tenq?_jwordbuffer.fp) ; 
if ( (temp^wordbuffer.fp«fopen(temp.wordbuffer. filename, ««rb+") ) «NOLL) 

^ printf {" No good on open file joe 

exit(O); 

fgetpos(temp_wordbuffer.fp, &temp_wordbuf f er . f pos ); 

/* temp wordbuf fer2; /* 16 bit i««9%^^^^^^°^,.?T"^^^"^ 
strcpy(temp_wordbuffer2. filename, "e:tmp_wrd2") ; 

if ( (temp_wordbuffer2.fp=fopen(temp.wordbuffer2. filename, "wb") )==NULL 



){ 



printf (" No good on open file joe ") ; 
exit(O) ; 

pibuf = temp_wordbuffer2.buf ; 
for {i=0; i<XDIM; i++) * (pibuf ++) -0; 
for(i=0;i<YDIM;i++) { 

fwrite(temp_wordbuffer2.buf ,8izeof (int) ,XDIMR,temp_wordbuf fer2.fp) ; 

f close (temp_wordbuf f er2 .fp) ; 
if ( ( temp^wordbuf f er2 . f p=f open (temp.wordbuf f er2 . filename, -rb+" ) ) «NUL 

printf (" No good on open file joe ") ; 
exit(O) ; 

f getpos (temp.wordbuf f er2 . f p, &temp_wordbuf f er2 . f pos ) ; 

/* snow images; BITS number of constituent snowy pictures */ 
Q trcnv ( snow images . filename , " snw_amgs " } ; . 

printf C No good on open file 3oe ) ; 
exit(O) ; 

*pucbuf = snow_images.buf ; 
f or (i=0 ; i<XDIM;i++) * (pucbuf ++) =0 ; 
for(j«0;j<BITS;j++){ 

for(i=0;i<YDIM;i++) { . . ^ 

f write (snow_images .buf , sizeof (unsigned 
char) ,XDlMR,snow_images.fp) ; 

printf(" No good on open file 3oe ); 
exit(O); 

f getpos ( snow^images . f p , &snow_images . f pos ) ; 
return (1) ; . 
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int close_everything(void) { 

f close ( test_iinage . fp) ; 

f close (snow_composite. fp) • 

fclose(distributed_image,fp) / 

f close ( temp_image . fp) ; 

f close (temp^wordbuf f er . fp) ; 

f close (tenip^wordbuf f er2 . fp) ; 

f close (snow_iinages . fp) ; 



return (1) ; 



mainO { 

int i,j; 



printf ("\nlnitiali2ing. . .\ii\n") ■ 

in.txalize.everything{); /* device specific and global mallocs 
live_video() ; 

brSt? "r^n^ -f"? ^° yo^'want to run?\n\n«); 
SimulatiSS"; ^^'^^^ '"^^^'^ High ^End Photography 

printf ("\nEnter ^er a^d^rt^^^^^/t'^' m-Camera) \n.) ; 
scanf ("%d»,&deino); ^ctum. 
if (demo < 1 | | demo > mdm DEMOS) I 
printf CXn eh eh ») ;~ ^ 

} 

/* acquire test image */ 

^eS/;^'^""" ""^^ ^'^-'^ test scene is ready... «, ; 

specff £-ftu?|ir"'-"^^^' /*9^^-frame takes care of device 
/* proinpt for id number, 0 through 255 */ 

image : " ) ; 

is wx be the unique magic code placed 'into the 
scanf ( " %d" , &our_code) ; 
while (our_code<l || our_code>256) { 

prxntf(" Between o and 255 please ») • 
^ scanf ("%d«,&our_code); f ) . 

h feed back the binary code which will be embedded in the image 

printf ("\nThe binary sequence ") • 
for(i=0;i<BITS;i++) { 
j = 128 » i; 

if( our^code & j)printf ("i") ; 
^ else printf ("0") ; 

printf{. (%d) will be embedded on the image\n- , our_code) ; 

/* now generate the individual snow images */ 
get_snow_images ( ) ; *uM3e«> / 
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loop.viBualO; /* this gives visual feedback on 'tolerable' 
noise level */ 

printf ("\nwe're now to the simulated suspect... \n-) ; 
loop^simulation { > ; 

close_everything{) ? 
ret\im(0) ; 

} 
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Claims 

1. A method of identification coding an input signal so as to pamit its later 
identification, the method including the steps: 

modulating a noise signal with a code number to produce a signature signal; and 
modulating the input signal with the signature sigmU to produce an identification 
coded output signal; 

wherein the coded output sigmd can be analyzed to discern the code number with 
M^db it was modulated. 

2. A method of identification coding an input sigmJ so as to produce an 
encoded output signal, the input signal being a quantized signal having inherent noise, said signal 
coirespondmg to aural or visual information, the identification coding of the output signal 
preserving the corresponding aural/visual infbnnation without hmnan-perceptible degradation, the 
Identification coding pennitting later identification of the output signal, the method including 
modulating a noise signal with a code numb« to produce a signature sign^, addmg the signature 
sigmd to the input signal- to produce an identification coded output signal, the signature signal 
havmg an amplitude below a threshold of human aural/visual perceptibility when added to the 
mm Signal, the addmg step efffectmg distribution of the sigmtture signal throughout the entirety of 
the output signal. 

3. A"««hodofdataprocessmgmctodmg:providmgadigitalcarriersignd.and 
modulating the digital carrier signal to impereeptibly embed an identification signal thereon, the 
method chamcterized by: compressing the modukted digital canier signal with lossy data 
compression to produce a compressed signal, decompressing the compressed signal, and discemmg 
the embedded identification signal from the decompressed signal, wherem the lossy data 
compression does not preclude recovery of the embedded identification signal. 

4. An apparatus for encoding a sampled input signal, the sampled input signal 
havmg mherent noise, the apparatus including an mput terminal, a digital noise souree. stoi^e for 
an Identification code word, means for mamtammg a pomter to a bit of the identification code 
word, an adder, and an output tennmal. the mput temunal bemg coupled to a first input of the 
adder, the noise souree being «,upled to a second mput of the adder, the pomter providing said bft 
of the Identification code word to a control input of the adder, an output of the adder being 
coupled to flie output terminal. 

5. The apparatus of claim 4 which further includes a look-up table, a first scaler 
a second scaler, a scale control device, and a memoiy. the look-up table having an input coupled ' 
to the input termmal. one of said scalers having a control input coupled to an output of the look- 
up table, the other of said scalers havmg a control input coupled to the scale control device said 
scalers being serially interposed between the noise somx=e and the adder, the memory having an 
mput coupled to a location between the noise souree and the second input of the adder 
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6. A method of identification coding a sampled input signal, the sampled mput 
signal having inherent noise, flie mcfliod comprising: 
providing an N-bit code nmnber; 
fe eadi of a plurality of samples of the input signal: 

(a) providing a sample of a time- or spatially-vaiying modulation signal; 

(b) selecting one bit of the N-bit code number, and 

(c) if said bit has a first vahie, adding the modulation signal sample to 
the sample of the input signal, yielding a sample of an identification coded 
output signal. 

7. The method of claim 6 whiA includes performing steps (a) -(c) for each 

sample of the tapat signal 

8. The method of claim 6 which further includes storing, for later use. data from 

which fte modulation agnal sample can be reconstructed. 

9. The method of claim 6 which includes generating the time-varying 
modulation signal sample by providing a pseudo-random number and weighting said nmnber with 
a scaling factor, said scaling factor being a function of the input signal sample. 

10. The method of dann 6 which inctodes selecting the one bit of the W,iX 
code number by cycling through the number, advancing one bit position for each successive 

sample of the iiq>ut signal. 

11. Themefliodofclaimewhidifurflierindudes: 

if said selected bit of the N-bit code number has a second value, subtracting the 
modulation signal sample from the sample of the input sigmO, yielding a sample of the 

identification coded output signal. 

12. Storage medium having stored thereon a signal processed in accordance with 

the method of claim 6. 

13. Tbe invention of claim 12 in which the storage medhmi is a magnetic 



14. TTie invention of claim 12 in which the storage medimn is a printed 

15. Tbe invention of claim 12 in which the storage medhnn is a compact disk 



medium. 
mediuuL 

(CD). . , r 

16. A method of identification coding each of a plurality of samples of a 

sampled input signal, the input signal having inherent noise, characterized by: 

using the sample of the input signal to obtain a scaling fector miiquely associated 

thereto; 

weighting a signature dawm in accordance with said scalmg fiictor. and 
modulating the sample of the input signal m accordance with said weighted 

signature datum. 
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17. ^en-ethodofclaim 16 in which the scaling factors ma«asen.<m^ 
with the values of the input signal samples with which they are associated. 

18. THe method of claim 16 in which a four-fold increase in a value of the 
««npled input signal con^onds to appn»dmately a two-fold increase in a value of the s«ding 
tactor associated therewiflt 

19. A method of processing a sampled input signal with an N-bit signature word 
to produce an identification^oded outpm signal, the sampled input signal having inherent noise 
^^cin the complete N-bit signature finds expression M times in an excerpt of the identification- 
coded omput signal having a length of M'N samples, for some vatoe of M greater than one 

20. The method of daim 19 characterized by processing each sample of the 
input signal in accordance with at least part of the signature word. 

21. In a method of processing a source signal that inchKles a nmnber of 
elements, each with an associated value, an imprevement characterized by altering the souree 
sii^m acconbnce with an embedded signal so as to encode an identification code therein, the 
embedded and altered signals each including a nmnber of elements, each with an aviated value 
wherem an element of .he altered ^alhasav 

» both the souree and embedded signals, and in which the ide.«ification code and certam pseudo- 
nndom reference data are used to generate the embedded signal, the association between the 
embedded signal and the identificadon code being midiscemible without availability of the 
reference data. 

22. In a method of processmg a source signal that inchides a number of 

elements, each with an associated value, an hnprovement characterized by: 

providing an N bit digital identification code, each bit having a "l" or "O" value- 
providing N different reference signals, one being associated with each bit 

position in the digital identification code; 

for which the corresponding bit position mft^ 
Identification code has a -1 - vahie. thereby prefacing an embedded signal; 

altering the souree signal in accordance with the embedded signal so as to encode 

an identification code therein; 

the embedded and altered signals each includmg a nmnber of elements, each with 
an associated value, wherein an element of the altered signal has a value diferent than that of 
corresponding elements in both the source and embedded signals. 
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